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Related Applications 
This application is a nonpro visional application which claims priority under 35 



U.S.C. § 119(e) to U.S. Provisional Patent Application Number 60/532,004, entitled DEFINITrVE 
ENDODERM, filed December 23, 2003, and which clahns priority under 35 U.S.C. § 119(e) to 
U.S. Provisional Patent Application Number 60/586,566, entitled CHEMOKINE CELL SURFACE 
RECEPTOR FOR THE ISOLATION OF DEFINITIVE ENDODERM, filed July 9, 2004, and 
which claims priority under 35 U.S.C. § 119(e) to U.S. Provisional Patent Application Number 
60/587,942, entitled CHEMOKINE CELL SURFACE RECEPTOR FOR THE ISOLATION OF 
DEFINITIVE ENDODERM, filed July 14, 2004. The disclosure of each of the above-listed 
priority applications is incorporated herein by reference in its entirety. 



[0002] The present invention relates to the fields of medicine and cell biology. In 
particular, the present invention relates to compositions comprising mammalian definitive 
endoderm cells and methods of making, isolating and using such cells. 



[0003] Human pluripotent stem cells, such as embryonic stem (ES) cells and 
embryonic germ (EG) cells, were first isolated in culture without fibroblast feeders in 1994 
(Bongso et al., 1994) and with fibroblast feeders (Hogan, 1997). Later, Thomson, Reubinoff and 
Shamblott established continuous cultures of human ES and EG cells using mitotically inactivated 
mouse feeder layers (Reubinoff et al., 2000; Shamblott et aL, 1998; Thomson et al., 1998). 

[0004] Human ES and EG cells (hESGs) offer unique opportunities for investigating 
early stages of human development as well as for therapeutic intervention in several disease states, 
such as diabetes mellitus and Parlcinson's disease. For example, the use of insulin-producing P- 
cells derived fi-om hESCs would offer a vast improvement over current cell therapy procedures 
which utilize cells fi-om donor pancreases. However, presently it is not known how to generate an 
insulin-producing P-cell firom hESCs. As such, current cell therapy treatments for diabetes 
mellitus, which utilize islet cells fi-om donor pancreases, are limited by the scarcity of high quality 
islet cells needed for transplant. Cell therapy for a single Type I diabetic patient requires a 
transplant of approximately 8 x 10* pancreatic islet cells. (Shapiro et al., 2000; Shapiro et al., 
2001a; Shapiro et al., 2001b). As such, at least two healthy donor organs are required to obtain 
sufficient islet cells for a successfiil transplant. HESCs offer a source of starting material from 
which to develop substantial quantities of high quality differentiated cells for human cell therapies. 
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[0005] Two properties that make hESCs uniquely suited to cell therapy applications 
are pluripotence and the ability to maintain these cells in culture for prolonged periods without 
accumulation of genetic changes. Pluripotency is defined by the ability of hESCs to differentiate to 
derivatives of all 3 primary germ layers (endoderm, mesoderm, ectoderm) which, in tum, form all 
cell somatic types of the mature organism in addition to extraembryonic tissues (e.g. placenta) and 
germ cells. Although pluripotency imparts extraordinary utility upon hESCs, this property also 
poses unique challenges for the study and manipulation of these cells and their derivatives. Owing 
to the large variety of cell types that may arise in differentiating hESC cultures, the vast majority of 
cell types are produced at very low efficiencies. Additionally, success in evaluating production of 
any given cell type depends critically on defining appropriate markers. Achieving efficient, 
directed differentiation is of great importance for therapeutic application of hESCs. 

[0006] In order to use hESCs as a starting material to generate cells that are useful in 
cell therapy applications, it would be advantageous to overcome the foregoing problems. For 
example, in order to achieve the level of cellular material required for islet cell transplantation 
therapy, it would be advantageous to efficiently direct hESCs toward the pancreatic islet/p-cell 
lineage at the very earliest stages of differentiation. 

[0007] In addition to efficient direction of the differentiation process, it would also be 
beneficial to isolate and characterize intermediate cell types along the differentiation pathway 
towards the pancreatic islet/p-cell lineage and to use such cells as appropriate lineage precursors 
for further steps in the differentiation. 

Summary of the Invention 

[0008] Some embodiments of the present invention relate to cell cultures comprising 
definitive endoderm cells, wherein the definitive endoderm cells are multipotent cells that can 
differentiate into cells of the gut tube or organs derived fi-om the gut tube. In accordance with 
certain embodiments, the definitive endoderm cells are mammalian cells, and in a preferred 
embodiment, the definitive endoderm cells are human cells. In some embodiments of the present 
invention, definitive endoderm cells express or fail to significantly express certain markers. In 
some embodiments, one or more markers selected from SOX 17, CXCR4, MIXLl, GATA4, 
HNF3b, GSC, FGF17, VWF, CALCR, FOXQl, CMKORl and CRIPl are expressed in defmitive 
endoderm cells. In other embodiments, one or more markers selected from OCT4, alpha- 
fetoprotein (AFP), Thrombomodulin (TM), SPARC and SOX7 are not significantly expressed in 
definitive endoderm cells. 

(0009] In accordance with other embodiments of the present invention, methods of 
producing definitive endoderm fi-om pluripotent cells are described. In some embodiments, 
pluripotent cells are derived from a morula. In some embodiments, pluripotent stem cells are stem 
cells. Stem cells used in these methods can include, but are not limited to, embryonic stem cells. 
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Embryonic stem cells can be derived from the embryonic inner cell mass or from the embryonic 
gonadal ridges. Embryonic stem cells can originate from a variety of animal species including, but 
not limited to, various mammalian species including himians. In a preferred embodiment, human 
embryonic stem cells are used to produce definitive endoderm. 

[0010] In some embodiments of the present invention, one or more growth factors are 
used in the differentiation process from pluripotent cell to definitive endoderm cell. The one or 
more growth factors used in the differentiation process can include growth factors from the TGFp 
superfamily. In such embodiments, the one or more growth factors comprise the Nodal/Activin 
and/or the BMP subgroups of the TGFp superfamily of growth factors. In some embodiments, the 
one or more growth factors are selected from the group consisting of Nodal, Activin A, Activin B, 
BMP4, Wnt3a or combinations of any of these growth factors. 

[0011] Embodiments of the present invention also relate to populations of cells 
enriched in definitive endoderm cells, hi certain embodiments, the definitive endoderm cells are 
isolated or substantially purified. In some embodiments, the isolated or substantially purified 
definitive endoderm cells express the SOX 17 and/or the CXRC4 marker to a greater extent than the 
OCT4, AFP, TM, SPARC and/or SOX7 markers. 

[0012] Methods for enriching a cell population with definitive endoderm are also 
contemplated. In some embodiments, definitive endoderm cells can be isolated or substantially 
purified from a mixed cell population by contacting the cells with a reagent that binds to a 
molecule that is present on the surface of definitive endoderm cells but which is not present on the 
surface of other cells in the mixed cell population, and then isolating the cells bound to the reagent. 
In certain embodiments, the molecule that is present on the surface of definitive endoderm cells is 
CXCR4. 

[0013] Still other embodiments of the present invention relate to CXCR4 antibodies, 
SDF-1 ligands or other ligands for CXCR4 can be used to obtain definitive endoderm cells in an 
enriched, isolated or substantially purified form. For example, a CXCR4 antibody, an SDF-1 
ligand or another ligand for CXCR4 can be used as a reagent in a method, such as affinity-based 
separation or magnetic-based separation, to enrich, isolate or substantially purify preparations of 
definitive endoderm cells which bind to the reagent. 

[0014] Other embodiments of the invention described herein relate to compositions, 
such as cell cultures, which comprise pluripotent cells and definitive endoderm cells, hi certain 
embodiments, the cell cultures comprise both stem cells and definitive endoderm cells. The 
number of stem cells present in such cultures can be greater than, equal to or less than the number 
of definitive endoderm cells in the culture, hi some embodiments, the stem cells are human 
embryonic stem cells. In certain embodiments the hESCs are maintained on a feeder layer. In such 
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embodiments, the feeder layer cells can be cells, such as fibroblasts, which are obtained from 
humans, mice or any other suitable organism. 

[0015] In some embodiments of the present invention, the compositions comprising 
definitive endoderm cells and hESCs also includes one or more growth factors. Such growth 
factors can include growth factors from the TGFp superfamily. In such embodiments, the one or 
more growth factors comprise the Nodal/Activin and/or the BMP subgroups of the TGFp 
superfamily of growth factors. In some embodiments, the one or more growth factors are selected 
from the group consisting of Nodal, Activin A, Activin B, BMP4, Wnt3a or combinations of any of 
these groAvth factors. 

[0016] Other embodiments of the present inventions are described with reference to 
the numbered paragraphs below: 

[0017] 1. A cell culture comprising human cells wherein at least about 10% of said 
human cells are definitive endoderm cells, said definitive endoderm cells being multipotent cells 
that can differentiate into cells of the gut tube or organs derived therefrom. 

[0018] 2. The cell culture of paragraph 1, wherein at least about 50% of said human 
cells are definitive endoderm cells. 

[0019] 3. The cell culture of paragraph 1, wherein at least about 80% of said human 
cells are definitive endoderm cells. ^ 

[0020] 4. The cell culture of paragraph 1, wherein said definitive endoderm cells 
express a marker selected from the group consisting of SOX 17 and CXCR4. 

[0021] 5. The cell culture of paragraph 4, wherein the expression of a marker 
selected from the group consisting of SOX 17 and CXCR4 is greater than the expression of a 
marker selected from the group consisting of OCT4, alpha-fetoprotein (AFP), Thrombomodulin 
(TM), SPARC and SOX7 in said definitive endoderm cells. 

[0022] 6. The cell culture of paragraph 4, wherein said defmitive endoderm cells do 
not express a marker selected from the group consisting of OCT4, AFP, TM, SPARC and SOX7. 

[0023] 7. The cell culture of paragraph 4, wherein said defmitive endoderm cells 
express a marker selected from the group consisting of MDCLl, GATA4 and HNF3b. 

[0024] 8. The cell culture of paragraph 4, wherein said definitive endoderm cells 
express a marker selected from the group consisting of FGF17, VWF, CALCR, FOXQl, CMKORl 
andCREPl. 

[0025] 9. The cell culture of paragraph 1, wherein said definitive endoderm cells 
express SOX17 and CXCR4. 

[0026] 10. The cell culture of paragraph 9, wherein the expression of SOX17 and 
CXCR4 is greater than the expression of OCT4, AFP, TM, SPARC and SOX7 in said defmitive 
endoderm cells. 
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[0027] 11. The cell culture of paragraph 9, wherein said definitive endoderm cells do 
not express OCT4, AFP, TM, SPARC and SOX7, 

[0028] 12. The cell culture of paragraph 9, wherein said definitive endoderm cells 
express MDflLl, GATA4 and HNF3h. 

[0029] 13. The cell culture of paragraph 9, wherein said definitive endoderm cells 
express a marker selected fi-om the group consisting of FGF17, VWF, CALCR, FOXQl, CMKORl 
andCRIPl, 

[0030] 14. The cell culture of paragraph 1, wherein at least about 2 definitive 
endoderm cells are present for about every 1 pluripotent cell in said cell culture. 

[0031] 15. The cell culture of paragraph 14, wherein said pluripotent cell comprises 
an embryonic stem cell. 

[0032] 16. The cell culture of paragraph 15, wherein said embryonic stem cell is 
derived from a tissue selected fi-om the group consisting of the morula, the inner cell mass (ICM) of 
an embryo and the gonadal ridges of an embryo. 

[0033] 17. The cell culture of paragraph 1 ftjrther comprising a medium which 
comprises less than about 10% serum. 

[0034] 18. The cell culture of paragraph 1 further comprising a growth factor of the 
Nodal/Activin subgroup of the TGFp superfamily. 

[0035] 19; The cell culture of paragraph 1, further comprising a growth factor 
selected from the group consisting of Nodal, Activin A, Activin B and combinations thereof 

[0036] 20. A cell population comprising cells wherein at least about 90% of said cells 
are human definitive endoderm cells, said human defmitive endoderm cells being multipotent cells 
that can differentiate into cells of the gut tube or organs derived therefrom. 

[0037] 21. The cell population of paragraph 20, wherein at least about 95% of said 
cells are human definitive endoderm cells. 

[0038] 22. The cell population of paragraph 20, wherein at least about 98% of said 
cells are human definitive endoderm cells. 

[0039] 23. The cell population of paragraph 20, wherein said human definitive 
endoderm cells express a marker selected from the group consisting of SOX17 and CXCR4. 

[0040] 24. The cell population of paragraph 23, wherein the expression of a marker 
selected from the group consisting of SOX 17 and CXCR4 is greater than the expression of a 
marker selected from the group consisting of OCT4, AFP, TM, SPARC and SOX7 in said human 
definitive endoderm cells. 

[0041] 25. The cell population of paragraph 23, wherein said himian definitive 
endoderm cells do not express a marker selected from the group consisting of OCT4, AFP, TM, 
SPARC and SOX7. 
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[0042] 26. The cell population of paragraph' 23, wherein said human definitive 
endoderm cells express a marker selected from the group consisting of MDCLl, GATA4 and 
HNF3b. 

[0043] 27. The cell population of paragraph 23, wherein said definitive endoderm 
cells express a marker selected from the group consisting of FGF17, VWF, CALCR, FOXQl, 
CMKORl andCRIPl. 

[0044] 28. The cell population of paragraph 20, wherein said human definitive 
endoderm cells express SOX 1 7 and CXCR4. 

[0045] 29. The cell population of paragraph 28, wherein the expression of SOX17 and 
CXCR4 is greater than the expression of OCT4, AFP, TM, SPARC and SOX7 in said human 
definitive endoderm cells. 

[0046] 30. The cell population of paragraph 28, wherein said human definitive 
endoderm cells do not express OCT4, AFP, TM, SPARC and SOX7. 

[0047] 31. The cell population of paragraph 28, wherein said human definitive 
endoderm cells express MIXLl, GATA4 and HNF3b. 

[0048] 32. The cell population of paragraph 28, wherein said definitive endoderm 
cells express a marker selected from the group consisting of FGF17, VWF, CALCR, FOXQl, 
CMKORl andCRIPl. 

[0049] 33. The cell population of paragraph 20, wherein at least about 2 human 
definitive endoderm cells are present for about every 1 pluripotent cell in said cell population. 

[0050] 34. The cell population of paragraph 33, wherein said pluripotent cell 
comprises an embryonic stem cell. 

[005 1] 35. The cell population of paragraph 34, wherein said embryonic stem cell is 
derived from a tissue selected from the morula, the ICM of an embryo and the gonadal ridges of an 
embryo. 

[0052] 36. A method of producing definitive endoderm cells, said method comprising 
the steps of: 

[0053] obtaining a cell population comprising pluripotent human cells; 

[0054] providing said cell population with at least one growth factor of the TGFp 
superfamily in an amount sufficient to promote differentiation of said pluripotent cells to definitive 
endoderm cells, said definitive endoderm cells being multipotent cells that can differentiate into 
cells of the gut tube or organs derived therefrom; and 

[0055] allowing sufficient time for definitive endoderm cells to form, wherein said 
sufficient time for definitive endoderm cells to form has been determined by detecting the presence 
of definitive endoderm cells in said cell population. 
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[0056] 37. The method of paragraph 36, wherein at least about 10% of said 
pliiripotent cells differentiate into definitive endoderm cells. 

[0057] 38. The method of paragraph 36, wherein at least about 50% of said 
pluripotent cells differentiate into definitive endoderm cells. 

[0058] 39. The method of paragraph 36, wherein at least about 70% of said 
pluripotent cells differentiate into definitive endoderm cells. 

[0059] 40. The method of paragraph 36, wherein at least about 80% of said 
pluripotent cells differentiate into definitive endoderm cells. 

[0060] 41. The method of paragraph 36, wherein detecting the presence of definitive 
endoderm cells in said cell population comprises detecting the expression of at least one marker 
selected firom the group consisting of SOX 17 and CXCR4 and at least one marker from the group 
consisting of OCT4, AFP, TM, SPARC and SOX7 in cells of said cell population, wherein the 
expression of a marker selected from the group consisting of SOX 17 and CXCR4 is greater than 
the expression of a marker selected from the group consisting of OCT4, AFP, TM, SPARC and 
SOX7 in said definitive endoderm cells. 

[0061] 42. The method of paragraph 36, wherein detecting the presence of definitive 
endoderm cells in said cell population comprises detecting the expression of at least one marker 
selected from the group consisting of SOX17 and CXCR4 and at least one marker from the group 
consisting of AFP, TM, and SOX7 in cells of said cell population, wherein the expression of a 
marker selected from the group consisting of SOX 17 and CXCR4 is greater than the expression of 
a marker selected from the group consisting of AFP, TM, and SOX7 in said defmitive endoderm 
cells. 

[0062] 43 . The method of paragraph 42, wherein the expression of at least one of said 
markers is determined by Q-PCR. 

[00631 44. The method of paragraph 42, wherein the expression of at least one of said 
markers is determined by immunocytochemistry. 

[0064] 45. The method of paragraph 36, wherein detecting the presence of defmitive 
endoderm cells in said cell population comprises detecting the expression of at least one marker 
selected from the group consisting of FGF17, VWF, CALCR, FOXQl, CMKORl and CRIPl and 
at least one marker from the group consisting of OCT4, AFP, TM, SPARC and SOX7 in cells of 
said cell population, wherein the expression of a marker selected from the group consisting of 
FGF17, VWF, CALCR, FOXQl, CMKORl and CRIPl is greater than the expression of a marker 
selected from the group consisting of OCT4, AFP, TM, SPARC and SOX7 in said defmitive 
endoderm cells. 

[0065] 46. The method of paragraph 36, wherein said at least one growth factor is of 
the Nodal/ Activin subgroup of the TGFP superfamily. 
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[0066] 47. The method of paragraph 46, wherein said at. least one growth factor is 
selected from the group consisting of Nodal Activin A, Activin B and combinations thereof 

[0067] 48. The method of paragraph 47, wherein said at least one growth factor is 

Nodal. 

[0068] 49. The method of paragraph 47, wherein said at least one growth factor is 
Activin A. 

[0069] 50. The method of paragraph 47, wherein said at least one growth factor is 
Activin B. 

[0070] 51. The method of paragraph 36, wherein a plurality of growth factors of the 
TGFp superfamily is provided. 

[0071] 52. The method of paragraph 51, wherein said plurality of growth factors 
comprises Nodal Activin A and Activin B. 

[0072] 53. The method of paragraph 36, wherein said at least one growth factor is 
provided in a concentration of at least about 10 ng/ml. 

[0073] 54. The method of paragraph 36, wherein said at least one growth factor is 
provided in a concentration of at least about 100 ng/ml. 

[0074] 55. The method of paragraph 36, wherein said at least one growth factor is 
provided in a concentration of at least about 500 ng/ml. 

[0075] 56. The method of paragraph 36, wherein said at least one growth factor is 
provided in a concentration of at least about 1000 ng/ml, 

[0076] 57. The method of paragraph 36, wherein said at least one growth factor is 
provided in a concentration of at least about 5000 ng/ml. 

[0077] 58. The method of paragraph 36, wherein said cell population is grown in a 
medium comprising less than about 10% serum, 

[0078] 59. The method of paragraph 36, wherein said pluripotent cells comprise stem 

cells. 

[0079] 60. The method of paragraph 59, wherein said pluripotent cells comprise 
embryonic stem cells. 

[0080] 61. The method of paragraph 60, wherein said embryonic stem cells are 
derived from a tissue selected from the group consisting of the morula, the ICM of an embryo and 
the gonadal ridges of an embryo. 

[0081] 62, A definitive endoderm cell produced by the method of paragraph 36. 

[0082] 63. A method of producing a cell population enriched in definitive endoderm 
cells, said method compriising the steps of: 
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[0083] differentiating cells in a population of pluripotent human cells so as to produce 
definitive endoderm cells, said definitive endoderm cells being multipotent cells that can 
differentiate into cells of the gut tube or organs derived therefi-om; 

[0084] providing to said cell population a reagent which binds to a marker expressed 
in said definitive endoderm cells but which is not substantially expressed in other cell types present 
in said cell population; and 

[0085] separating said definitive endoderm cells bound to said reagent from said other 
cell types present in said cell population, thereby producing a cell population enriched in definitive 
endoderm cells. 

[0086] 64. The method of paragraph 63, wherein the differentiating step further 
comprises obtaining a cell population comprising pluripotent human cells, providing said cell 
population with at least one growth factor of the TGpp superfamily in an amount sufficient to 
promote differentiation of said pluripotent cells to definitive endoderm cells, said definitive 
endoderm cells being multipotent cells that can differentiate into cells of the gut tube or organs 
derived therefrom, and allowing sufficient time for definitive endoderm cells to form, wherein said 
sufficient time for definitive endoderm cells to form has been determined by detecting the presence 
of definitive endoderm cells in said cell population. 

[0087] 65. The method of paragraph 63, wherein detecting comprises detecting the 
expression of at least one marker selected from the group consisting of SOX 17 and CXCR4 and at 
least one marker from the group consisting of OCT4, AFP, TM, SPARC and SOX7 in cells of said 
cell population, wherein the expression of a marker selected from the group consisting of SOX 17 
and CXCR4 is greater than the expression of a marker selected from the group consisting of OCT4, 
AFP, TM, SPARC and SOX7 in said definitive endoderm cells. 

[0088] 66. The method of paragraph 63, wherein detecting comprises detecting the 
expression of at least one marker selected from the group consisting of SOX 17 and CXCR4 and at 
least one marker from the group consisting of AFP, TM, and SOX7 in cells of said cell population, 
wherein the expression of a marker selected from the group consisting of SOX 17 and CXCR4 is 
greater than the expression of a marker selected from the group consisting of AFP, TM, and SOX7 
in said definitive endoderm cells. 

[0089] 67, The method of paragraph 63, wherein detecting comprises detecting the 
expression of at least one marker selected from the group consisting of FGF17, VWF, CALCR, 
FOXQl, CMKORl and CRIPl and at least one marker from the group consisting of OCT4, AFP, 
TM, SPARC and SOX7 in cells of said cell population, wherein the expression of a marker 
selected from the group consisting of FGF17, VWF, CALCR, FOXQl, CMKORl and CRIPl is 
greater than the expression of a marker selected from the group consisting of OCT4, AFP, TM, 
SPARC and SOX7 in said definitive endoderm cells. 
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[0090] 68. The method of paragraph 63, wherein at least about 95% of said cells are 
definitive endoderm cells. 

[0091] 69. The method of paragraph 63, wherein at least about 98% of said cells are 
definitive endoderm cells. 

[0092] 70. The method of paragraph 63, wherein said marker is CXCR4. 

[0093] 71 . The method of paragraph 63, wherein said reagent is an antibody 

[0094] 72. The method of paragraph 71, wherein said antibody has affinity for 
CXCR4. 

[0095] 73. An enriched population of definitive endoderm cells produced by the 
method of paragraph 63. 

[0096] 74. The cell culture of any one of paragraphs 4 or 9, wherein said definitive 
endoderm cells do not significantly express a marker selected from the group consisting of OCT4, 
AFP, TM, SPARC and SOX7. 

[0097] 75. The cell population of any one of paragraphs 23 or 28, wherein said 
definitive endoderm cells do not significantly express a marker selected from the group consisting 
of OCT4, AFP, TM, SPARC and SOX7. 

[0098] It will be appreciated that the methods and compositions described above 
relate to cells cultured in vitro. However, the above-described in vitro differentiated cell 
compositions may be used for in vivo applications. 

[0099] Additional embodiments of the present inventions may also be found in United 
States Provisional Patent Application No. 60/532,004, entitled DEFINrnVE ENDODERM, filed 
December 23, 2003; U.S. Provisional Patent Application Number 60/586,566, entitled 
CHEMOKBSTE CELL SURFACE RECEPTOR FOR THE ISOLATION OF DEFINITIVE 
ENDODERM, filed July 9, 2004; and U.S. Provisional Patent Application Number 60/587,942, 
entitled CHEMOKINE CELL SURFACE RECEPTOR FOR THE ISOLATION OF DEFINITIVE 
ENDODERM, filed July 14, 2004, the disclosures of which are incorporated herein by reference in 
their entireties. 

Brief Description of the Drawings 
[0100] Figure 1 is a schematic of a proposed differentiation pathway for the 
production of beta-cells from hESCs. The first step in the pathway commits the ES cell to the 
definitive endoderm lineage and represents one of the earliest known steps in the further 
differentiation of ES cells to pancreatic endoderm, endocrine endoderm, or islet/beta-cell. Some 
factors useful for mediating this transition are members of the TGPp family which include, but are 
not limited to, activins, nodals and BMPs. Exemplary markers for defining the definitive 
endoderm target cell are SOX17, GATA4, HNF3b, MDCl and CXCR4. 
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[0101] Figure 2 is a diagram of the human SOX17 cDNA which displays the positions 
of conserved motifs and highlights the region used for the immunization procedure by GENOVAC. 

[0102] Figure 3 is a relational dendrogram illustrating that SOX17 is most closely 
related to SOX7 and somewhat less to SOX18. The SOX17 proteins are more closely related 
among species homologs than to other members of the SOX group F subfamily within the same 
species. 

[0103] Figure 4 is a Western blot probed with the rat anti-SOX17 antibody. This blot 
demonstrates the specificity of this antibody for human SOX 17 protein over-expressed in 
fibroblasts (lane 1) and a lack of immunoreactivity with EGFP (lane 2) or the most closely related 
SOX family member, SOX7 Oane 3). 

[0104] Figures 5 A-B are micrographs showing a cluster of SOXl 7* cells that display a 
significant number of AFP^ co-labeled cells (A). This is in striking contrast to other SOX 17^ 
clusters (B) where little or no AFF^ cells are obiserved. 

[0105] Figures 6A-C are micrographs showing parietal endoderm and SOX17, Panel 
A shows immunocytochemistry for human Thrombomodulin (TM) protein located on the cell 
surface of parietal endoderm cells in randomly differentiated cultures of hES cells. Panel B is the 
identical field shown in A double-labeled for TM and SOX17. Panel C is the phase contrast image 
of the same field with DAPI labeled nuclei. Note the complete correlation of DAPI labeled nuclei 
and SOX 17 labeling. 

[0106] Figures 7 A-B are bar charts showing SOX 17 gene expression by quantitative 
PGR (Q-PCR) and anti-SOX17 positive cells by SOX17-specific antibody. Panel A shows that 
Activin A increases SOX 17 gene expression while retinoic acid (RA) strongly suppresses SOX 17 
expression relative to the undifferentiated control media (SR20). Panel B shows the identical 
pattern as well as a similar magnitude of these changes is reflected in SOXH"" cell number, 
indicating that Q-PCR measurement of SOX 17 gene expression is very reflective of changes at the 
single cell level. 

[0107] Figure 8A is a bar chart which shows that a culture of differentiating hESCs in 
the presence of Activin A maintains a low level of AFP gene expression while cells allowed to 
randomly differentiate in 10% fetal bovine serum (FBS) exhibit a strong upregulation of AFP. The 
difference in expression levels is approximately 7-fold. 

[0108] Figures 8B-C are images of two micrographs showing that the suppression of 
AFP expression by Activin A is also evident at the single cell level as indicated by the very rare 
and small clusters of AFP^ cells observed in Activin A treatment conditions (bottom) relative to 
10% FBS alone (top). 

[0109] Figures 9A-B are comparative images showing the quantitation of the AFP* 
cell number using flow cytometry. This figure demonstrates that the magnitude of change in AFP 
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gene expression (Figure 8A) in the presence (right panel) and absence (left panel) of Activin A 
exactly corresponds to the number of AFP* cells, fiirther supporting the utility of Q-PCR analyses 
to indicate changes occurring at the individual cell level. 

[0110] Figures lOA-F are micrographs which show that exposure of hESCs to nodal, 
Activin A and Activin B (NAA) yields a striking increase in the number of SOX 17* cells over the 
period of 5 days (A-C). By comparing to the relative abundance of SOX 17* cells to the total 
number of cells present in each field, as indicated by DAPI stained nuclei (D-F), it can be seen that 
approximately 30-50% of all cells are immunoreactive for SOX 17 after five days treatment with 
NAA. 

[0111] Figure 11 is a bar chart which demonstrates that Activin A (0, 10, 30 or 100 
ng/mL) dose-dependently increases SOX17 gene expression in differentiating hESCs. Increased 
expression is already robust after 3 days of treatment on adherent cultures and continues through 
subsequent 1, 3 and 5 days of suspension culture as well. 

[0112] Figures 12 A-C are bar charts which demonstrate the effect of Activin A on the 
expression of MIXLl (panel A), GATA4 (panel B) and HNF3b (panel C). Activin A dose- 
dependent increases are also observed for three other markers of definitive endoderm; MIXLl, 
GATA4 and HNF3b. The magnitudes of increased expression in response to activin dose are 
strikingly similar to those observed for SOX 17, strongly indicating that Activin A is specifying a 
population of cells that co-express all four genes (SOX17^, MIXLl*, GATA4'^ and HNF3b*). 

[0113] Figures 13 A-C are bar charts which demonstrate the effect of Activin A on the 
expression of AFP (panel A), SOX7 (panel B) and SPARC (panel C). There is an Activin A dose- 
dependent decrease in expression of the visceral endoderm marker AFP. Markers of primitive 
endoderm (SOX7) and parietal endoderm (SPARC) remain either unchanged or exhibit suppression 
at some time points indicating that Activin A does not act to specify these extra-embryonic 
endoderm cell types. This further supports the fact that the increased expression of SOX17, 
MIXLl, GATA4, and HlSfF3b are due to an increase in the number of definitive endoderm cells in 
response to Activin A. 

[0114] Figures 14A-B are bar charts showing the effect of Activin A on ZICl (panel 
A) and Brachyury expression (panel B) Consistent expression of the neural marker ZICl 
demonstrates that there is not a dose-dependent effect of Activin A on neural differentiation. There 
is a notable suppression of mesoderm differentiation mediated by 100 ng/mL of Activin A 
treatment as indicated by the decreased expression of brachyury. This is likely the result of the 
increased specification of definitive endoderm from the mesendoderm precursors. Lower levels of 
Activin A treatment (10 and 30 ng/mL) maintain the expression of brachyury at later time points of 
differentiation relative to untreated control cultures. 
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[0115] Figures 15A-B are micrographs showing decreased parietal endoderm 
difTerentiation in response to treatment with activins. Regions of TM*" parietal endoderm are found 
through the culture (A) when differentiated in serum alone, while differentiation to TM* cells is 
scarce when activins are included (B) and overall intensity of TM immunoreactivity is lower. 

[0116] Figures 16A-D are micrographs which show marker expression in response to 
treatment with Activin A and Activin B. hESCs were treated for four consecutive days with 
Activin A and Activin B and triple labeled with SOX17, AFP and TM antibodies. Panel A - 
SOX17; Panel B - AFP; Panel C - TM; and Panel D - Phase/DAPL Notice the numerous SOX17 
positive cells (A) associated with the complete absence of AFP (B) and TM (C) immunoreactivity. 

[0117] Figure 17 is a micrograph showing the appearance of definitive endoderm and 
visceral endoderm in vitro from hESCs. The regions of visceral endoderm are identified by 
AFP*"/SOX17^" while definitive endoderm displays the complete opposite profile, SOX17*^/AFP*°^'. 
This field was selectively chosen due to the proximity of these two regions to each other. However, 
there are numerous times when SOX17'^/AFP*°^' regions are observed in absolute isolation from any 
regions of AFP*" cells, suggesting the separate origination of the definitive endoderm cells from 
visceral endoderm cells. 

[0118] Figure 18 is a diagram depicting the TGFp family of ligands and receptors. 
Factors activating AR Smads and BR Smads are useful in the production of definitive endoderm 
from human embryonic stem cells (see, J Cell Physiol 187:265-76). 

[0119] Figure 19 is a bar chart showing the induction of SOX17 expression over time 
as a result of treatment with individual and combinations of TGFp factors. 

[0120] Figure 20 is a bar chart showing the increase in SOXn"" cell number with time 
as a result of treatment with combinations of TGFp factors. 

[0121] Figure 21 is a bar chart showing induction of SOX 17 expression over time as a 
result of treatment with combinations of TGFp factors. 

[0122] Figure 22 is a bar chart showing that Activin A induces a dose-dependent 
increase in SOX 17^ cell number. 

[0123] Figure 23 is a bar chart showing ttiat addition of Wnt3a to Activin A and 
Activin B treated cultures increases SOX17 expression above the levels induced by Activin A and 
Activin B alone. 

[0124] Figures 24A-C are bar charts showing differentiation to definitive endoderm is 
enhanced in low FBS conditions. Treatment of hESCs with activins A and B in media containing 
2% FBS (2AA) yields a 2-3 times greater level of SOX 17 expression as compared to the same 
treatment in 10% FBS media (lOAA) (panel A). Induction of the definitive endoderm marker 
MDCLl (panel B) is also affected in the same way and the suppression of AFP (visceral endoderm) 
(panel C) is greater in 2% FBS than in 10% FBS conditions. 
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[0125] Figures 25A-D are micrographs which show SOXIT*^ cells are dividing in 
culture. SOX 17 immunoreactive cells are present at the differentiating edge of an hESC colony (C, 
D) and are labeled with proliferating cell nuclear antigen (PCNA) ^anel B) yet are not co-labeled 
with OCT4 (panel C). In addition, clear mitotic figures can be seen by DAPI labeling of nuclei in 
both SOXl?"" cells (arrows) as well as OCT4^ undifferentiated hESCs (arrowheads) (D). 

[0126] Figure 26 is a bar chart showing the relative expression level of CXCR4 in 
differentiating hESCs under various media conditions. 

[0127] Figures 27A-D are bar charts that show how a panel of defmitive endoderm 
markers share a very similar pattern of expression to CXCR4 across the same differentiation 
treatments displayed in Figure 26. 

[0128] Figures 28A-E are bar charts showing how markers for mesoderm 
(BRACHYURY, MOXl), ectoderm (SOXl, ZICl) and visceral endoderm (SOX7) exhibit an 
inverse relationship to CXCR4 expression across the same treatments displayed in Figure 26. 

[0129] Figxires 29A-F are micrographs that show the relative difference in SOX 17 
immunoreactive cells across three of the media conditions displayed in Figures 26-28. 

[0130] Figures 30A-C are flow cytometry dot plots that demonstrate the increase in 
CXCR4'^ cell number with increasing concentration of activin A added to the differentiation media. 

[0131] Figures 31A-D are bar charts that show the CXCR4^ cells isolated from the 
high dose activin A treatment (A100-CX+) are even further enriched for defmitive endoderm 
markers than the parent population (A 100). 

[0132] Figure 32 is a bar chart showing gene expression from CXCR4^ and CXCR4' 
cells isolated using fluorescence-activated cell sorting (FACS) as well as gene expression in the 
parent populations. This demonstrates that the CXCR4^ cells contain essentially all the CXCR4 
gene expression present in each parent population and the CXCR4" populations contain very little 
or no CXCR4 gene expression. 

[0133] Figures 33A-D are bar charts that demonstrate the depletion of mesoderm 
(BRACHYURY, MOXl), ectoderm (ZICl) and visceral endoderm (SOX7) gene expression in the 
CXCR4-f cells isolated from the high dose activin A treatment which is already suppressed in 
expression of these non-definitive endoderm markers. 

[0134] Figures 34A-M are bar charts showdng the expression patterns of marker genes 
that can be used to identify defmitive endoderm cells. The expression analysis of defmitive 
endoderm markers, FGF17, VWF, CALCR, FOXQl, CMKORl and CRIPl is shown in panels G- 
L, respectively. The expression analysis of previously described lineage marking genes, SOX17, 
SOX7, SOX17/SOX7, TM, ZICl, and MOXl is shown in panels A-F, respectively. Panel M 
shows the expression analysis of CXCR4. With respect to each of panels A-M, the column labeled 
hESC indicates gene expression from purified human embryonic stem cells; 2NF indicates cells 
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treated with 2% FBS, no activin addition; O.IAIOO indicates cells treated with 0.1% FBS, 100 
ng/ml Activin A; lAlOO indicates cells treated with 1% FBS, 100 ng/ml Activin A; and 2A100 
indicates cells treated with 2% FBS, 100 ng/ml Activin A. 

Detailed Description 

[0135] A crucial stage in early human development termed gastrulation occurs 2-3 
weeks after fertilization. Gastrulation is extremely significant because it is at this time that the 
three primary germ layers are first specified and organized (Lu et al., 2001; Schoenwolf and Smith, 
2000). The ectoderm is responsible for the eventual formation of the outer coverings of the body 
and the entire nervous system whereas the heart, blood, bone, skeletal muscle and other connective 
tissues are derived fi-om the mesoderm. Definitive endoderm is defined as the germ layer that is 
responsible for formation of the entire gut tube which includes the esophagus, stomach and small 
and large intestines, and the organs which derive fi-om the gut tube such as the lungs, liver, thymus, 
parathyroid and thyroid glands, gall bladder and pancreas (Grapin-Botton and Melton, 2000; 
Kimelman and Griffin, 2000; Tremblay et al., 2000; Wells and Melton, 1999; Wells and Melton, 
2000). A very important distinction should be made between the definitive endoderm and the 
completely separate lineage of cells termed primitive endoderm. The primitive endoderm is 
primarily responsible for formation of extra-embryonic tissues, mainly the parietal and visceral 
endoderm portions of the placental yolk sac and the extracellular matrix material of Reichert's 
membrane. 

10136] During gastrulation, the process of definitive endoderm formation begins with 
a cellular migration event in which mesendoderm cells (cells competent to form mesoderm or 
endoderm) migrate through a structure called the primitive streak. Definitive endoderm is derived 
from cells, which migrate through the anterior portion of the streak and through the node (a 
specialized structure at the anterior-most region of the streak). As migration occurs, definitive 
endoderm populates first the most anterior gut tube and culminates with the formation of the 
posterior end of the gut tube. 

[0137] In vivo analyses of the formation of definitive endoderm, such as the studies in 
Zebrafish ^ndXenopus by Conlon et al., 1994; Feldman et al., 1998; Zhou et al., 1993; Aoki et al., 
2002; Dougan et al., 2003; Tremblay et al., 2000; Vincent et al., 2003; Alexander et al., 1999; 
Alexander and Stainier, 1999; Kikuchi et al., 2001; Hudson et al., 1997 and in mouse by Kanai- 
Azuma et al,, 2002 lay a foundation for how one might attempt to approach the development of a 
specific germ layer cell type in the culture dish using human embryonic stem cells. There are two 
aspects associated with in vitro ESC culture that pose major obstacles in the attempt to recapitulate 
development in the culture dish. First, organized germ layer or organ structures are not produced. 
The majority of germ layer and organ specific genetic markers will be expressed in a 
heterogeneous fashion in the differentiating hESC culture system. Therefore it is difficult to 
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evaluate formation of a specific tissue or cell type due to this lack of organ specific boundaries. 
Almost all genes expressed in one cell type within a particular germ layer or tissue type are 
expressed in other cells of different germ layer or tissue types as well. Without specific boundaries 
there is considerably less means to assign gene expression specificity with a small sample of 1-3 
genes. Therefore one must examine considerably more genes, some of which should be present as 
well as some that should not be expressed in the particular cell type of the organ or tissue of 
interest. Second, the timing of gene expression patterns is crucial to movement down a specific 
developmental pathway. 

[0138] To further complicate matters, it should be noted that stem cell differentiation 
in vitro is rather asynchronous, likely considerably more so than in vivo. As such, one group of 
cells may be expressing genes associated with gastrulation, while another group may be starting 
final differentiation. Furthermore, manipulation of hESC monolayers or embryoid bodies (EBs) 
with or without exogenous factor application may result in profound differences with respect to 
overall gene expression pattern and state of differentiation. For these reasons, the application of 
exogenous factors must be timed according to gene expression patterns within a heterogeneous cell 
mixture in order to efficiently move the culture down a specific differentiation pathway. It is also 
beneficial to consider the morphological association of the cells in the culture vessel. The ability 
to uniformly influence hESCs when formed into so called embryoid bodies may be less optimal 
than hESCs grown and differentiated as monolayers and or hESC colonies in the culture vessel. 

[0139] As an effective way to deal with the above-mentioned problems of 
heterogeneity and asynchrony, some embodiments of the present invention contemplate combining 
a method for differentiating cells with a method for the enrichment, isolation and/or purification of 
intermediate cell types in the differentiation pathway. 

[0140] Embodiments of the present invention relate to novel, defined processes for the 
production of definitive endoderm cells in cultvire by differentiating pluripotent cells, such as stem 
cells into multipotent definitive endoderm cells. As used herein, "multipotenf ' or "multipotent 
cell" refers to a cell type that can give rise to a limited number of other particular cell types. As 
described above, definitive endoderm cells do not differentiate into tissues produced from 
ectoderm or mesoderm, but rather, differentiate into the gut tube as well as organs that are derived 
from the gut tube. In certain preferred embodiments, the definitive endoderm cells are derived 
from hESCs. Such processes can provide the basis for efficient production of human endodermal 
derived tissues such as pancreas, liver, lung, stomach, intestine and thyroid. For example, 
production of definitive endoderm may be the first step in differentiation of a stem cell to a 
functional insulin-producing p-celL To obtain usefiil quantities of insulin-producing p-cells, high 
efficiency of differentiation is desirable for each of the differentiation steps that occur prior to 
reaching the pancreatic islet/p-cell fate. Since differentiation of stem cells to definitive endoderm 



-16- 



o 



o 



cells represents perhaps the earliest step towards the production of functional pancreatic islet/p- 
cells (as shown in Figure 1), high efficiency of differentiation at this step is particularly desirable. 

[0141] In view of the desirability of efficient differentiation of pluripotent cells to 
definitive endoderm cells, some aspects of the present invention relate to in vitro methodology that 
results in approximately 50-80% conversion of pluripotent cells to definitive endoderm cells. 
Typically, such methods encompass the application of culture and growth factor conditions in a 
defined and ten^orally specified fashion. Further enrichment of the cell population for definitive 
endoderm cells can be achieved by isolation and/or purification of the definitive endoderm cells 
from other cells in the population by using a reagent that specifically binds to definitive endoderm 
cells. As such, aspects of the present invention relate to definitive endoderm cells as well as 
methods for producing and isolating and/or purifying such cells. 

[0142] In order to determine the amount of definitive endoderm cells in a cell culture 
or cell population, a method of distinguishing this cell type from the other cells in the culture or in 
the population is desirable. Accordingly, certain embodiments of the present invention relate to 
cell markers whose presence, absence and/or relative expression levels are specific for definitive 
endoderm and methods for detecting and determining the expression of such markers. As- used 
herein, "expression" refers to the production of a material or substance as well as the level or 
amount of production of a material or substance. Thus, determining the expression of a specific 
marker refers to detecting either the relative or absolute amount of the marker that is expressed or 
simply detecting the presence or absence of the marker. As used herein, "marker" refers to any 
molecule that can be observed or detected. For example, a marker can include, but is not limited 
to, a nucleic acid, such as a transcript of a specific gene, a polypeptide product of a gene, a non- 
gene product polypeptide, a glycoprotein, a carbohydrate, a glycolipd, a lipid, a lipoprotein or a 
small molecule. 

[0143] In some embodiments of the present invention, the presence, absence and/or 
level of expression of a marker is determined by quantitative PCR (Q-PCR). For example, the 
amount of transcript produced by certain genetic markers, such as SOX 17, CXCR4, OCT4, AFP, 
TM, SPARC, SOX7, MDCLl, GATA4, HNF3b, GSC, FGF17, VWF, CALCR, FOXQl, CMKORl, 
CRIPl and other markers described herein is determined by quantitative Q-PCR. In other 
embodiments, immunohistochemistry is used to detect the proteins expressed by the above- 
mentioned genes. In still other embodiments, Q-PCR and immunohistochemical techniques are 
both used to identify and determine the amount or relative proportions of such markers. 

[0144] By using methods, such as those described above, to determine the expression 
of one or more appropriate markers, it is possible to identify definitive endoderm cells, as well as 
determine the proportion of definitive endoderm cells in a cell culture or cell population. For 
example, in some embodiments of the present invention, the definitive endoderm cells or cell 
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populations that are produced express the SOX17 and/or the CXCR4 gene at a level of about 2 
orders of magnitude greater than non-definitive endoderm cell types or cell populations. In other 
embodiments, the definitive endoderm cells or cell populations that are produced express the 
SOX 17 and/or the CXCR4 gene at a level of more than 2 orders of magnitude greater than non- 
definitive endoderm cell types or cell populations. In still other embodiments, the definitive 
endoderm cells or cell populations that are produced express one or more of the markers selected 
fi-om the group consisting of SOX17, CXCR4, GSC, FGF17, VWF, CALCR, FOXQl, CMKORl 
and CRIPl at a level of about 2 or more than 2 orders of magnitude greater than non-definitive 
endoderm cell types or cell populations. 

[0145] Further aspects of the present invention relate to cell cultures comprising 
definitive endoderm as well as cell populations enriched in definitive endoderm cells. As such, 
certain embodiments relate to cell cultures which comprise definitive endoderm cells, wherein at 
least about 50-80% of the cells in culture are definitive endoderm cells. A preferred embodiment 
relates to cells cultures comprising human cells, wherein at least about 50-80% of the human cells 
in culture are definitive endoderm cells. Because the efficiency of the differentiation procedure 
can be adjusted by modifying certain parameters, which include but are not limited to, cell growth 
conditions, growth factor concentrations and the timing of culture steps, the differentiation 
procedures described herein can result in about 5%, about 10%, about 15%, about 20%, about 
25%, about 30%, about 35%, about 40%, about 45%, about 50%, about 55%, about 60%, about 
65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, or greater than about 
95% conversion of pluripotent cells to definitive endoderm. In other embodiments of the present 
invention, conversion of a pluripotent cell population, such as a stem cell population, to 
substantially pure definitive endoderm cell population is contemplated. 

[0146] The compositions and methods described herein have several useful features. 
For example, the cell cultures and cell populations comprising definitive endoderm as well as the 
methods for producing such cell cultures and cell populations are useful for modeling the early 
stages of human development. Furthermore, the compositions and methods described herein can 
also serve for therapeutic intervention in disease states, such as diabetes mellitus. For example, 
since definitive endoderm serves as the source for only a limited number of tissues, it can be used 
in the development of pure tissue or cell types. 

PRODUCTION OF DEFINITIVE ENDODERM FROM PLURIPOTENT CELLS 

[0147] The definitive endoderm cell cultures and compositions comprising definitive 
endoderm cells that are described herein can be produced from pluripotent cells, such as embryonic 
stem cells. As used herein, "embryonic" refers to a range of developmental stages of an organism 
beginning with a single zygote and ending with a multicellular structure that no longer comprises 
pluripotent or totipotent cells other than developed gametic cells. In addition to embryos derived 
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by gamete fusion, the term "embryonic" refers to embryos derived by somatic cell nuclear transfer. 
A preferred method for deriving definitive endoderm cells utilizes human embryonic stem cells 
(hESC) as the starting material for definitive endoderm production. The embryonic stem cells used 
in this method can be cells that originate from the morula, embryonic inner cell mass or those 
obtained from embryonic gonadal ridges. Human stem cells can be maintained in culture in a 
pluripotent state without substantial differentiation using methods that are known in the art. Such 
methods are described, for example, in US Patent Nos. 5,453,357, 5,670,372, 5,690,926 5,843,780, 
6,200,806 and 6,251,671 the disclosures of which are incorporated herein by reference in their 
entireties. 

[0148] In some embodiments of the methods described herein, hESCs are maintained 
on a feeder layer. In such embodiments, any feeder layer which allows hESCs to be maintained in 
a pluripotent state can be used in the methods described herein. One commonly used feeder layer 
for the cultivation of human embryonic stem cells is a layer of mouse fibroblasts. More recently, 
hxunan fibroblast feeder layers have been developed for use in the cultivation of hESCs (see US 
Patent Application No. 2002/0072117, the disclosure of which is incorporated herein by reference 
in its entirety). Alternative embodiments of the methods described herein permit the maintenance 
of pluripotent hESC without the use of a feeder layer. Such methods have been described in US 
Patent Application No. 2003/0175956, the disclosure of which is incorporated herein by reference 
in its entirety. 

[0149] The human embryonic stem cells used herein can be maintained in culture 
either with or without serum. In some embodiments, serum replacement is used. In other 
embodiments, serum free culture techniques, such as those described in US Patent Application No. 
2003/0190748, the disclosure of which is incorporated herein by reference in its entirety, are used. 

[0150] Stem cells are maintained in culture in a pluripotent state by routine passage 
until it is desired that they be differentiated into definitive endoderm. In some embodiments, 
differentiation to definitive endoderm is achieved by providing to the stem cell culture a growth 
factor of the TGFp superfamily in an amount sufficient to promote differentiation to definitive 
endoderm. Growth factors of the TGFp superfamily which are useful for the production of 
definitive endoderm are selected from the Nodal/Activin or BMP subgroups. In some 
embodiments of the differentiation methods described herein, the growth factor is selected from the 
group consisting of Nodal, Activin A, Activin B and BMP4. Additionally, the growth factor 
Wnt3a and other Wnt family members are useful for the production of definitive endoderm cells. 
In certain embodiments of the present invention, combinations of any of the above-mentioned 
growth factors can be used. 

[0151] With respect to some of the embodiments of differentiation methods described 
herein, the above-mentioned growth factors are provided to the cells so that the growth factors are 
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present in the cultures at concentrations sufficient to promote differentiation of at least a portion of 
the stem cells to definitive endoderm. hi some embodiments of the present invention, the above- 
mentioned growth factors are present in the cell culture at a concentration of at least about 5 ng/ml, 
at least about 10 ng/ml, at least about 25 hg/ml, at least about 50 ng/ml, at least about 75 ng/ml, at 
least about 100 ng/ml, at least about 200 ng/ml, at least about 300 ng/ml, at least about 400 ng/ml, 
at least about 500 ng/ml, at least about 1000 ng/ml, at least about 2000 ng/ml, at least about 3000 
ng/ml, at least about 4000 ng/ml, at least about 5000 ng/ml or more than about 5000 ng/ml. 

[0152] In certain embodiments of the present invention, the above-mentioned growth 
factors are removed from the cell culture subsequent to their addition. For example, the growth 
factors can be removed within about one day, about two days, about three days, about four days, 
about five days, about six days, about seven days, about eight days, about nine days or about ten 
days after their addition. In a preferred embodiment, the growth factors are removed about four 
days after their addition. 

[0153] Cultures of definitive endoderm cells can be grown in medium containing 
reduced serum or no serum. In certain embodiments of the present invention, serum concentrations 
can range from about 0.05% v/v to about 20% v/v. For example, in certain embodiments, the 
serum concentration of the medium can be less than about 0.05% (v/v), less than about 0.1% (v/v), 
less than about 0.2% (v/v), less than about 0.3% (v/v), less than about 0.4% (v/v), less than about 
0.5% (v/v), less than about 0.6% (v/v), less than about 0.7% (v/v), less than about 0.8% (v/v), less 
than about 0.9% (v/v), less than about 1% (v/v), less than about 2% (v/v), less than about 3% (v/v), 
less than about 4% (v/v), less than about 5% (v/v), less than about 6% (v/v), less than about 7% 
(v/v), less than about 8% (v/v), less than about 9% (v/v), less than about 10% (v/v), less than about 
15% (v/v) or less than about 20% (v/v). In some embodiments, definitive endoderm cells are 
grown without serum. In other embodiments, definitive endoderm cells are grown with serum 
replacement. In still other embodiments, definitive endoderm cells are grown in the presence of 
B27. In such embodiments, the concentration of B27 supplement can range from about 0.2% v/v to 
about 20% v/v. 

[0154] The progression of the hESC culture to definitive endoderm can be monitored 
by determining the expression of markers characteristic of definitive endoderm. In some 
embodiments, the expression of certain markers are determined by detecting the presence or 
absence of the marker. Alternatively, the expression of certain markers can determined by 
measuring the level at which the marker is present in the cells of the cell culture or cell population. 
In such embodiments, the measurement of marker expression can be qualitative or quantitative. 
One method of quantitating the expression markers that are produced by marker genes is through 
the use of quantitative PGR (Q-PCR). Methods of performing Q-PCR are well known in the art. 
Other methods which are known in the art can also be used to quantitate marker gene expression. 
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For example, the expression of a marker gene product can be detected by using antibodies specific 
for the marker gene product of interest. In some embodiments of the present invention, the 
expression of marker genes characteristic of definitive endoderm as well as the lack of significant 
expression of marker genes characteristic of hESCs and other cell types is determined. 

[0155] As described further in the Examples below, a reliable marker of definitive 
endoderm is the SOX 17 gene. As such, the definitive endoderm cells produced by the methods 
described herein express the SOX 17 marker gene, thereby producing the SOX 17 gene product. 
Other markers of definitive endoderm are MDCLl, GATA4, HNF3b, GSC, FGF17, VWF, CALCR, 
FOXQl, CMKORl and CRIPl. In some embodiments of the present invention, definitive 
endoderm cells express the SOX 17 marker gene at a level higher than that of the SOX7 marker 
gene, which is characteristic of primitive and visceral endoderm (see Table 1). Additionally, in 
some embodiments, expression of the SOX 17 marker gene is higher than the expression of the 
OCT4 marker gene, which is characteristic of hESCs. In other embodiments of the present 
invention, definitive endoderm cells express the SOX 17 marker gene at a level higher than that of 
the AFP, SPARC or Thrombomodulin (TM) marker genes. In certain embodiments of the present 
invention, the SOX17-expressing definitive endoderm cells produced by the methods described 
herein do not express significant levels or amounts of PDXl (PDXl -negative). 

[0156] Another marker of definitive endoderm is the CXCR4 gene. The CXCR4 gene 
encodes a cell surface chemokine receptor whose ligand is the chemoattractant SDF-1. The 
principal roles of the CXCR4 receptor-bearing cells in the adult are believed to be the migration of 
hematopoetic cells to the bone marrow, lymphocyte trafficking and the differentiation of various B 
cell and macrophage blood cell lineages [Kim, C, and Broxmeyer, H. J. Leukocyte Biol. 65, 6-15 
(1999)]. The CXCR4 receptor also functions as a coreceptor for the entry of HIV-1 into T-cells 
[Feng, Y., et al. Science, 272, 872-877 (1996)], In an extensive series of studies carried out by 
[McGrath, K.E. et al. Dev. Biology 213, 442-456 (1999)], the expression of the chemokine receptor 
CXCR4 and its unique ligand, SDF-1 [Kim, C, and Broxmyer, H., J. Leukocyte Biol. 65, 6-15 
(1999)], were delineated during early development and adult life in the mouse. The CXCR4/SDF1 
interaction in development became apparent when it was demonstrated that if either gene was 
disrupted in transgenic mice [Nagasawa et al. Nature, 382, 635-638 (1996)], Ma, Q,, et al 
Immunity, 10, 463-471 (1999)] it resulted in late embryonic lethality. McGrath et al. demonstrated 
that CXCR4 is the most abimdant chemokine receptor messenger RNA detected during early 
gastrulating embryos (E7.5) using a combination of RNase protection and in situ hybridization 
methodologies. In the gastrulating embryo, CXCR4/SDF-1 signaling appears to be mainly 
involved in inducing migration of primitive-streak germlayer cells and is expressed on definitive 
endoderm, mesoderm and extraembryonic mesoderm present at this time. In E7.2-7.8 mouse 
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embryos, CXCR4 and alpha-fetoprotein are mutually exclusive indicating a lack of expression in 
visceral endoderm [McGrath, K.E. et al. Dev. Biology 213, 442-456 (1999)]. 

(0157] In some embodiments of the present invention, the definitive endoderm cells 
produced by the methods described herein express the CXCR4 marker gene. In other 
embodiments, the definitive endoderm cells produced by the methods described herein express the 
CXCR4 marker gene as well as other markers of definitive endoderm including, but not limited to, 
S0X17, MIXLl, GATA4, HNF3b, GSC, FGF17, VWF, CALCR, FOXQl, CMKORl and CRIPl. 
In some embodiments of the present invention, definitive endoderm cells express the CXCR4 
marker gene at a level higher than that of the SOX7 marker gene. Additionally, in some 
embodiments, expression of the CXCR4 marker gene is higher than the expression of the OCT4 
marker gene. In other embodiments of the present invention, definitive endoderm cells express the 
CXCR4 marker gene at a level higher than that of the AFP, SPARC or Thrombomodulin (TM) 
marker genes. In certain embodiments of the present invention, the CXCR4-expressing definitive 
endoderm cells produced by the methods described herein do not express significant levels or 
amountsj)f PDXl (PDXl -negative). 

[0158] It will be appreciated that expression of CXCR4 in endodermal cells does not 
preclude the expression of SOX17. Accordingly, in some embodiments of the present invention, 
definitive endoderm cells are those that express both the SOX 17 and CXCR4 marker genes at a 
level higher than that of the SOX7 marker gene. Additionally, in some embodiments, the 
expression of both the SOX17 and CXCR4 marker genes is higher than the expression of the OCT4 
marker gene. In other embodiments of the present invention, definitive endoderm cells express 
both the SOX 17 and the CXCR4 marker genes at a level higher than that of the AFP, SPARC or 
Thrombomodulin (TM) marker genes. In certain embodiments of the present invention, the 
SOX17/CXCR4-expressing definitive endoderm cells produced by the methods described herein do 
not express significant levels or amounts of PDXl (PDXl -negative). 

[0159] It will be appreciated that SOX17 and/or CXCR4 marker expression is induced 
over a range of different levels in definitive endoderm cells depending on the differentiation 
conditions. As such, in some embodiments of the present invention, the expression of the SOX 17 
marker and/or the CXCR4 marker in definitive endoderm cells or cell populations is at least about 
2-fold higher to at least about 10,000-fold higher than the expression of the SOX 17 marker and/or 
the CXCR4 marker in non-definitive endoderm cells or cell populations, for example pluripotent 
stem cells. In other embodiments of the present invention, the expression of the SOX17 marker 
and/or the CXCR4 marker in definitive endoderm cells or cell populations is at least about 4-fold 
higher, at least about 6-fold higher, at least about 8-fold higher, at least about 10-fold higher, at 
least about 15-fold higher, at least about 20-fold higher, at least about 40-fold higher, at least about 
80-fold higher, at least about 100-fold higher, at least about 150-fold higher, at least about 200-fold 
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higher, at least about 500-fold higher, at least about 750-fold higher, at least about 1000-fold 
higher, at least about 2500-fold higher, at least about 5000-fold higher, at least about 7500-fold 
higher or at least about 10,000-fold higher than the expression of the SOX17 marker and/or the 
CXCR4 marker in non-definitive endoderm cells or cell populations, for example pluripotent stem 
cells. In some embodiments, the expression of the SOX17 marker and/or CXCR4 marker in 
definitive endoderm cells or cell populations is infinitely higher than the expression of the SOX 17 
marker and/or the CXCR4 marker in non-definitive endoderm cells or cell populations, for example 
pluripotent stem cells. 

[0160] It will be appreciated that in some embodiments of the present invention, the 
expression of markers selected from the group consisting of GATA4, MDCLl, HNF3b, GSC, 
FGF17, VWF, CALCR, FOXQl, CMKORl and CRIPl in definitive endoderm cells or cell 
populations is increased as compared to the expression of GATA4, MIXLl, HNF3b, GSC, FGF17, 
VWF, CALCR, FOXQl, CMKORl and CRIPl in non-definitive endoderm cells or cell 
populations. 

[0161] It will also be appreciated that there is a range of differences between the 
expression level of the SOX17 marker and the expression levels of the OCT4, SPARC, AFP, TM 
and/or SOX7 markers in definitive endoderm cells. Similarly, there exists a range of differences 
between the expression level of the CXCR4 marker and the expression levels of the OCT4, 
SPARC, AFP, TM and/or SOX7 markers in definitive endoderm cells. As such, in some 
embodiments of the present invention, the expression of the SOX 17 marker or the CXCR4 marker 
is at least about 2-fold higher to at least about 10,000-fold higher than the expression of OCT4, 
SPARC, AFP, TM and/or SOX7 markers. In other embodiments of the present invention, the 
expression of the SOX17 marker or the CXCR4 marker is at least about 4-fold higher, at least 
about 6-fold higher, at least about 8-fold higher, at least about 10-fold higher, at least about 15-fold 
higher, at least about 20-fold higher, at least about 40-fold higher, at least about 80-fold higher, at 
least about 100-fold higher, at least about 150-fold higher, at least about 200-fold higher, at least 
about 500-fold higher, at least about 750-fold higher, at least about 1000-fold higher, at least about 
2500-fold higher, at least about 5000-fold higher, at least about 7500-fold higher or at least about 
10,000-fold higher than the expression of OCT4, SPARC, AFP, TM and/or SOX7 markers. In 
some embodiments, OCT4, SPARC, AFP, TM and/or SOX7 markers are not significantly 
expressed in definitive endoderm cells. 

[0162] It will be appreciated that in some embodiments of the present invention, the 
expression of markers selected from the group consisting of GATA4, MDCLl, HNF3b, GSC, 
FGF17, VWF, CALCR, FOXQl, CMKORl and CRIPl in defmitive endoderm cells is increased as 
compared to the expression of OCT4, SPARC, AFP, TM and/or SOX7 in definitive endoderm 
cells. 
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COMPOSITIONS COMPRISING DEFINrnVE ENDODERM 

[0163] Some aspects of the present invention relate to compositions, such as cell 
populations and cell cultures, that comprise both pluripotent cells, such as stem cells, and definitive 
endojdCTm cells. For example, using the methods described herein, compositions comprising 
mixtures of hESCs and definitive endoderm cells can be produced. In some embodiments, 
compositions comprising at least about 5 definitive endoderm cells for about every 95 pluripotent 
cells are produced. In other embodiments, compositions comprising at least about 95 definitive 
endoderm cells for about every 5 pluripotent cells are produced. Additionally, compositions 
comprising other ratios of definitive endoderm cells to pluripotent cells are contemplated. For 
example, compositions comprising at least about 1 definitive endoderm cell for about every 
1,000,000 pluripotent cells, at least about 1 definitive endoderm cell for about every 100,000 
pluripotent cells, at least about 1 definitive endoderm cell for about every 10,000 pluripotent cells, 
at least about 1 definitive endoderm cell for about every 1000 pluripotent cells, at least about 1 
definitive endoderm cell for about every 500 pluripotent cells, at least about 1 definitive endoderm 
cell for about every 100 pluripotent cells, at least about 1 definitive endoderm cell for about every 
10 pluripotent cells, at least about 1 definitive endoderm cell for about every 5 pluripotent cells, at 
least about 1 definitive endoderm cell for about every 2 pluripotent cells, at least about 2 definitive 
endoderm cells for about every 1 pluripotent cell, at least about 5 definitive endoderm cells for 
about every 1 pluripotent cell, at least about 10 definitive endoderm cells for about every 1 
pluripotent cell, at least about 20 definitive endoderm cells for about every 1 pluripotent cell, at 
least about 50 definitive endoderm cells for about every 1 pluripotent cell, at least about 100 
definitive endoderm cells for about every 1 pluripotent cell, at least about 1000 definitive 
endoderm cells for about every 1 pluripotent cell, at least about 10,000 definitive endoderm cells 
for about every 1 pluripotent cell, at least about 100,000 definitive endoderm cells for about every 
1 pluripotent cell and at least about 1,000,000 definitive endoderm cells for about every 1 
pluripotent cell are contemplated. In some embodiments of the present invention, the pluripotent 
cells are human pluripotent stem cells. In certain embodiments the stem cells are derived fi-om a 
morula, the inner cell mass of an embryo or the gonadal ridges of an embryo. In certain other 
embodiments, the pluripotent cells are derived fi-om the gondal or germ tissues of a multicellular 
structure that has developed past the embryonic stage. 

[0164] Some aspects of the present invention relate to cell cultures or cell populations 
comprising from at least about 5% definitive endoderm cells to at least about 95% definitive 
endoderm cells. In some embodiments the cell cultures or cell populations comprise mammalian 
cells. In preferred embodiments, the cell cultures or cell populations comprise human cells. For 
example, certain specific embodiments relate to cell cultures comprising human cells, wherein 
from at least about 5% to at least about 95% of the human cells are definitive endoderm cells. 
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Other embodiments of the present invention relate to cell cultures comprising human cells, wherein 
at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at 
least about 30%, at least about 35%, at least about 40%, at least about 45%, at least about 50%, at 
least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at 
least about 80%, at least about 85%, at least about 90% or greater than 90% of the human cells are 
definitive endoderm cells. 

[0165] Further embodiments of the present invention relate to compositions, such as 
cell cultures or cell populations, comprising human cells, such as human definitive endoderm cells, 
wherein the expression of either the SOX17 or the CXCR4 marker is greater than the expression of 
the OCT 4, SPARC, alpha-fetoprotein (AFP), Thrombomodulin (TM) and/or SOX7 marker in at 
least about 5% of the human cells. In other embodiments, the expression of either the SOX 17 or 
the CXCR4 marker is greater than the expression of the OCT4, SPARC, AFP, TM and/or SOX7 
marker in at least about 10% of the human cells, in at least about 15% of the human cells, in at 
least about 20% of the human cells, in at least about 25% of the human cells, in at least about 30% 
of the human cells, in at least about 35% of the human cells, in at least about 40% of the human 
cells, in at least about 45% of the human cells, in at least about 50% of the human cells, in at least 
about 55% of the human cells, in at least about 60% of the human cells, in at least about 65% of the 
human cells, in at least about 70% of the human cells, in at least about 75% of the human cells, in 
at least about 80% of the human cells, in at least about 85% of the human cells, in at least about 
90% of the human cells, in at least about 95% of the human cells or in greater than 95% of the 
human cells. 

[0166] It will be appreciated that some embodiments of the present invention relate to 
compositions, such as cell cultures or cell populations, comprising human cells, such as human 
definitive endoderm cells, wherein the expression of one or more markers selected from the group 
consisting of GATA4, MIXLl, HNFSb, GSC, FGF17, VWF, CALCR, FOXQl, CMKORl and 
CRIPl is greater than the expression of the OCT4, SPARC, AFP, TM and/or SOX7 markers in 
from at least about 5% to greater than at least about 95% of the human cells, 

[0167] Still other embodiments of the present invention relate to compositions, such 
as cell cultures or cell populations, comprising human cells, such as human definitive endoderm 
cells, wherein the expression both the SOX 17 and the CXCR4 marker is greater than the 
expression of the OCT4, SPARC, AFP, TM and/or SOX7 marker in at least about 5% of the human 
cells. In other embodiments, the expression of both the SOX 17 and the CXCR4 marker is greater 
than the expression of the OCT4, SPARC, AFP, TM and/or SOX7 marker in at least about 10% of 
the human cells, in at least about 15% of the human cells, in at least about 20% of the human cells, 
in at least about 25% of the human cells, in at least about 30% of the human cells, in at least about 
35% of the human cells, in at least about 40% of the human cells, in at least about 45% of the 
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human cells, in at least about 50% of the human cells, in at least about 55% of the human cells, in 
at least about 60% of the human cells, in at least about 65% of the human cells, in at least about 
70% of the human cells, in at least about 75% of the human cells, in at least about 80% of the 
human cells, in at least about 85% of the human cells, in at least about 90% of the hxmian cells, in 
at least about 95% of the human cells or in greater than 95% of the human cells. 

[0168] It will be appreciated that some embodiments of the present invention relate to 
compositions, such as cell cultures or cell populations, comprising human cells, such as human 
definitive endoderm cells, wherein the expression of the GATA4, MIXLl, HNF3b, GSC, FGF17, 
VWF, CALCR, FOXQl, CMKORl and CRIPl markers is greater than the expression of the 
OCT4, SPARC, AFP, TM and/or SOX7 markers in from at least about 5% to greater than at least 
about 95% of the human cells. 

[0169] Additional embodiments of the present invention relate to compositions, such 
as cell cultures or cell populations, comprising mammalian endodermal cells, such as human 
endoderm cells, wherein the expression of either the SOX17 or the CXCR4 marker is greater than 
the expression of the OCT4, SPARC, AFP, TM and/or SOX7 marker in at least about 5% of the 
endodermal cells. In other embodiments, the expression of either the SOX 17 or the CXCR4 
marker is greater than the expression of the OCT4, SPARC, AFP, TM and/or SOX7 marker in at 
least about 10% of the endodermal cells, in at least about 15% of the endodermal cells, in at least 
about 20% of the endodermal cells, in at least about 25% of the endodermal cells, in at least about 
30% of the endodermal cells, in at least about 35% of the endodermal cells, in at least about 40% 
of the endodermal cells, in at least about 45% of the endodermal cells, in at least about 50% of the 
endodermal cells, in at least about 55% of the endodermal cells, in at least about 60% of the 
endodermal cells, in at least about 65% of the endodermal cells, in at least about 70% of the 
endodermal cells, in at least about 75% of the endodermal cells, in at least about 80% of the 
endodermal cells, in at least about 85% of the endodermal cells, in at least about 90% of the 
endodermal cells, in at least about 95% of the endodermal cells or in greater than 95% of the 
endodermal cells. 

[0170] It will be appreciated that some embodiments of the present invention relate to 
compositions, such as cell cultures or cell populations comprising mammalian endodermal cells, 
wherein the expression of one or more markers selected from the group consisting of GATA4, 
MIXLl, HNF3b, GSC, FGF17, VWF, CALCR, FOXQl, CMKORl and CRIPl is greater than the 
expression of the OCT4, SPARC, AFP, TM and/or SOX7 markers in from at least about 5% to 
greater than at least about 95% of the endodermal cells. 

[0171] Still other embodiments of the present invention relate to compositions, such 
as cell cultures or cell populations, comprising mammalian endodermal cells, such as human 
endodermal cells, wherein the expression both the SOX 17 and the CXCR4 marker is greater than 
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the expression of the OCT4, SPARC, AFP, TM and/or SOX7 marker in at least about 5% of the 
endodefmal cells. In other embodiments, the expression of both the SOX17 and the CXCR4 
marker is greater than the expression of the OCT4, SPARC, AFP, TM and/or SOX7 marker in at 
least about 10% of the endodermal cells, in at least about 15% of the endodermal cells, in at least 
about 20% of the endodermal cells, in at least about 25% of the endodermal cells, in at least about 
30% of the endodermal cells, in at least about 35% of the endodermal cells, in at least about 40% 
of the endodermal cells, in at least about 45% of the endodermal cells, in at least about 50% of the 
endodermal cells, in at least about 55% of the endodermal cells, in at least about 60% of the 
endodermal cells, in at least about 65% of the endodermal cells, in at least about 70% of the 
endodermal cells, in at least about 75% of the endodermal cells, in at least about 80% of the 
endodermal cells, in at least about 85% of the endodermal cells, in at least about 90% of the 
endodermal cells, in at least about 95% of the endodermal cells or in greater than 95% of the 
endodermal cells. 

[0172] It will be appreciated that some embodiments of the present invention relate to 
compositions, such as cell cultures or cell populations comprising mammalian endodermal cells, 
wherein the expression of the GATA4, MIXLl, HNF3b, GSC, FGF17, VWF, CALCR, FOXQl, 
CMKORl and CRIPl markers is greater than the expression of the OCT4, SPARC, AFP, TM 
and/or SOX7 markers in from at least about 5% to greater than at least about 95% of the 
endodermal cells. 

[0173] Using the methods described herein, compositions comprising definitive 
endoderm cells substantially free of other cell types can be produced. With respect to cells in cell 
cultures or in cell populations, the term "substantially free of means that the specified cell type of 
which the cell culture or cell population is free, is present in an amount of less than about 5% of 
the total number of cells present in the cell culture or cell population. In some embodiments of the 
present invention, the definitive endoderm cell populations or cell cultures produced by the 
methods described herein are substantially free of cells that significantly express the OCT4, SOX7, 
AFP, SPARC, TM, ZICl or BRACK marker genes. 

[0174] In one embodiment of the present invention, a description of a definitive 
endoderm cell based on the expression of marker genes is, SOX17 high, MIXLl high, AFP low, 
SPARC low, Thrombomodulin low, SOX7 low, CXCR4 high. 

ENRICHMENT. ISOLATION AND/OR PURIFICATION OF DEFDsnTIVE ENDODERM 

[0175] With respect to additional aspects of the present invention, definitive 
endoderm cells can be enriched, isolated and/or purified by using an affinity tag that is specific for 
such cells. Examples of affinity tags specific for definitive endoderm cells are antibodies, ligands 
or other binding agents that are specific to a marker molecule, such as a polypeptide, that is present 
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on the cell surface of definitive endoderm cells but which is not substantially present on other cell 
types that would be found in a cell culture produced by the methods described herein. In some 
embodiments, an antibody which binds to CXCR4 is used as an affinity tag for the enrichment, 
isolation or purification of definitive endoderm cells. In other embodiments, the chemokdne SDF-1 
or other molecules based on SDF-1 can also be used as affinity tags. Such molecules include, but 
not limited to, SDF-1 Augments, SDF-1 fiisions or SDF-1 mimetics. 

[0176] Methods for making antibodies and using them for cell isolation are known in 
the art and such methods can be implemented for use with the antibodies and cells described 
herein. In one embodiment, an antibody which binds to CXCR4 is attached to a magnetic bead 
then allowed to bind to definitive endoderm cells in a cell culture which has been enzymatically 
treated to reduce intercellular and substrate adhesion. The cell/antibody/bead complexes are then 
exposed to a movable magnetic field which is used to separate bead-bound definitive endoderm 
cells from unbound cells. Once the definitive endoderm cells are physically separated from other 
cells in culture, the antibody binding is disrupted and the cells are replated in appropriate tissue 
culture medium. 

[0177] Embodiments of the present invention contemplate additional methods for 
obtaining enriched, isolated or purified definitive endoderm cell cultures or populations. For 
example, in some embodiments, the CXCR4 antibody is incubated with definitive endoderm- 
containing cell culture that has been treated to reduce intercellular and substrate adhesion. The 
cells are then washed, centrifiiged and resuspended. The cell suspension is then incubated with a 
secondary antibody, such as an FITC-conjugated antibody that is capable of binding to the primary 
antibody. The cells are then washed, centrifiiged and resuspended in buffer. The cell suspension is 
then analyzed and sorted using a fluorescence activated cell sorter (FACS). CXCR4-positive cells 
are collected separately from CXCR4-negative cells, thereby resulting in the isolation of such cell 
types. If desired, the isolated cell compositions can be further purified by using an alternate 
affinity-based method or by additional rounds of sorting using the same or different markers that 
are specific for definitive endoderm. 

[0178] In other embodiments of the present invention, definitive endoderm is 
enriched, isolated and/or purified using a ligand or other molecule that binds to CXCR4. In some 
embodiments, the molecule is SDF-1 or a fragment, fiision or mimetic thereof 

[0179] In preferred embodiments, definitive endoderm cells are enriched, isolated 
and/or purified from other non-definitive endoderm cells after the stem cell cultures are induced to 
differentiate towards the definitive endoderm lineage. It will be appreciated that the above- 
described enrichment, isolation and purification procedures can be used with such cultures at any 
stage of differentiation. 
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[0180] In addition to the procedures just described, definitive endoderm cells may 
also be isolated by other techniques for cell isolation. Additionally, definitive endoderm cells may 
also be enriched or isolated by methods of serial subculture in growth conditions which promote 
the selective survival or selective expansion of said definitive endoderm cells. 

[0181] Using the methods described herein, enriched, isolated and/or purified 
populations of definitive endoderm cells and or tissues can be produced in vitro from pluripotent 
cell cultures or cell populations, such as stem cell cultures or populations, which have undergone at 
least some differentiation. In some embodiments, the cells undergo random differentiation. In a 
preferred embodiment, however, the cells are directed to differentiate primarily into definitive 
endoderm. Some preferred enrichment, isolation and/or purification methods relate to the in vitro 
production of definitive endoderm from human embryonic stem cells. Using the methods 
described herein, cell populations or cell cultures can be enriched in definitive endoderm content 
by at least about 2- to about 1000-fold as compared to untreated cell populations or cell cultures. 
In some embodiments, definitive endoderm cells can be enriched by at least about 5- to about 500- 
fold as compared to untreated cell populations or cell cultures. In other embodiments, defmitive 
endoderm cells can be enriched from at least about 10- to about 200-fold as compared to untreated 
cell populations or cell cultures. In still other embodiments, definitive endoderm cells can be 
enriched from at least about 20- to about 100-fold as compared to untreated cell populations or cell 
cultures. In yet other embodiments, definitive endoderm cells can be enriched from at least about 
40- to about 80-fold as compared to untreated cell populations or cell cultures. In certain 
embodiments, defmitive endoderm cells can be enriched from at least about 2- to about 20-fold as 
compared to untreated cell populations or cell cultures. 

[0182] Having generally described this invention, a further understanding can be 
obtained by reference to certain specific examples which are provided herein for purposes of 
illustration only, and are not intended to be limiting, 

EXAMPLES 

[0183] Many of the examples below describe the use of pluripotent human cells. 
Methods of producing pluripotent human cells are well known in the art and have been described 
numerous scientific publications, including U.S. Patent Nos. 5,453,357, 5,670,372, 5,690,926, 
6,090,622, 6,200,806 and 6,251,671 as well as U.S. Patent Application Publication No. 
2004/0229350, the disclosures of which are incorporated herein by reference in their entireties. 

EXAMPLE 1 
Human ES cells 

[0184] For our studies of endoderm development we employed human embryonic 
stem cells, which are pluripotent and can divide seemingly indefinitely in culture while maintaining 
a normal karyotype. ES cells were derived from the 5-day-old embryo inner cell mass using either 
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immunological or mechanical methods for isolation. In particular, the human embryonic stem cell 
line hESCyt-25 was derived from a supernumerary frozen embryo from an in vitro fertilization 
cycle following informed consent by the patient. Upon thawing the hatched blastocyst was plated 
on mouse embryonic fibroblasts (MEF), in ES medium (DMEM, 20% FBS, non essential amino 
acids, beta-mercaptoethanol, ITS supplement). The embryo adhered to the culture dish and after 
iapproximately two weeks, regions of undifferentiated hESCs were transferred to new dishes with 
MEFs. Transfer was accomplished with mechanical cutting and a brief digestion with dispase, 
followed by mechanical removal of the cell clusters, washing and re-plating. Since derivation, 
hESCyt-25 has been serially passaged over 100 times. We employed the hESCyt-25 human 
embryonic stem cell line as our starting material for the production of definitive endoderm. 

[0185] It will be appreciated by those of skill in the art that stem cells or other 
pluripotent cells can also be used as starting material for the differentiation procedures described 
herein. For example, cells obtained from embryonic gonadal ridges, which can be isolated by 
methods known in the art, can be used as pluripotent cellular starting material. 

EXAMPLE 2 
hESCyt-25 Characterization 
[0186] The human embryonic stem cell line, hESCyt-25 has maintained a normal 
morphology, karyotype, growth and self-renewal properties over 18 months in culture. This cell 
line displays strong immunoreactivity for the OCT4, SSEA-4 and TRA-1-60 antigens, all of which, 
are. characteristic of undifferentiated hESCs and displays alkaline phosphatase activity as well as a 
morphology identical to other established hESC lines. Furthermore, the human stem cell line, 
hESCyt-25, also readily forms embryoid bodies (EEs) when cultured in suspension. As a 
demonstration of its pluripotent nature, hESCyT-25 differentiates into various cell types that 
represent the three principal germ layers. Ectoderm production was demonstrated by Q-PCR for 
ZICl as well as immunocytochemistry (ICC) for nestin and more mature neuronal markers. 
Immunocytochemical staining for P-EH tubulin was observed in clusters of elongated cells, 
characteristic of early neurons. Previously, we treated EEs in suspension with retinoic acid, to 
induce differentiation of pluripotent stem cells to visceral endoderm (VE), an extra-embryonic 
lineage. Treated cells expressed high levels of a-fetoprotein (AFP) and SOX7, two markers of VE, 
by 54 hours of treatment. Cells differentiated in monolayer expressed AFP in sporadic patches as 
demonstrated by immunocytochemical staining. As will be described below, the hESCyT-25 cell 
line was also capable of forming definitive endoderm, as validated by real-time quantitative 
polymerase chain reaction (Q-PCR) and immunocytochemistry for SOX17, in the absence of AFP 
expression. To demonstrate differentiation to mesoderm, differentiating EBs were analyzed for 
Brachyury gene expression at several time points. Brachyury expression increased progressively 
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over the course of the experiment. In view of the foregoing, the hESCyT-25 line is pluripotent as 
shown by the ability to form cells representing the three germ layers. 

EXAMPLES 
Production of SOX17 Antibody 

[0187] A primary obstacle to the identification of definitive endoderm in hESC 
cultures is the lack of appropriate tools. We therefore undertook the production of an antibody 
raised against human SOX17 protein. 

[0188] The marker SOX 17 is expressed throughout the definitive endoderm as it 
forms during gastrulation and its expression is maintained in the gut tube (although levels of 
expression vary along the A-P axis) until aroimd the onset of organogenesis. SOX17 is also 
expressed in a subset of extra-embryonic endoderm cells. No expression of this protein has been 
observed in mesoderm or ectoderm. It has now been discovered that SOX 17 is an appropriate 
marker for the definitive endoderm lineage when used in conjunction with markers to exclude 
extra-embryonic lineages. 

[0189] As described in detail herein, the SOX17 antibody was utilized to specifically 
examine effects of various treatments and differentiation procedures aimed at the production of 
SOX 17 positive definitive endoderm cells. Other antibodies reactive to AFP, SPARC and 
Thrombomodulin were also employed to rule out the production of visceral and parietal endoderm 
(extra-embryonic endoderm). 

[0190] In order to produce an antibody against SOX17, a portion of the human 
SOX 17 cDNA (SEQ ID NO: 1) corresponding to amino acids 172-414 (SEQ ID NO: 2) in the 
carboxyterminal end of the SOX 17 protein (Figure 2) was used for genetic immunization in rats at 
the antibody production company, GENOVAC (Freiberg, Germany), according to procedures 
developed there. Procedures for genetic immunization can be found in US Patent Nos. 5,830,876, 
5,817,637, 6,165,993 and 6,261,281 as well as Intemational Patent Application Publication Nos. 
WOOO/29442 and W099/13915, the disclosures of which are incorporated herein by reference in 
their entireties. 

[0191] Other suitable methods for genetic immunization are also described in the non- 
patent literature. For example, Barry et al. describe the production of monoclonal antibodies by 
genetic immunization in Biotechniques 16: 616-620, 1994, the disclosure of which is incorporated 
herein by reference in its entirety. Specific examples of genetic immunization methods to produce 
antibodies against specific proteins can be found, for example, in Costaglia et aL, (1998) Genetic 
immunization against the human thyrotropin receptor causes thyroiditis and allows production of 
monoclonal antibodies recognizing the native receptor, J. Immunol. 160: 1458-1465; Kilpatrick et 
al (1998) Gene gun delivered DNA-based immunizations mediate rapid production of murine 
monoclonal antibodies to the Flt-3 receptor, Hybridoma 17: 569-576; Schmolke et aL, (1998) 
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Identification of hepatitis G virus particles in human serum by E2-specific monoclonal antibodies 
generated by DNA immunization, J, Virol. 12: 4541-4545; Krasemann et al., (1999) Generation of 
monoclonal antibodies against proteins with an unconventional nucleic acid-based immimization 
strategy, J. Biotechnol 73: 119-129; and Ulivieri et al, (1996) Generation of a monoclonal 
antibody to a defined portion of the Heliobacter pylori vacuolating cytotoxin by DNA 
immunization, J. Biotechnol 51: 191-194, the disclosures of which are incorporated herein by 
reference in their entireties. 

[0192] SOX7 and SOX18 are the closest Sox family relatives to SOX17 as depicted in 
the relational dendrogram shown in Figure 3. We employed the human SOX7 polypeptide as a 
negative control to demonstrate that the SOX17 antibody is specific for SOX17 and does not react 
with its closest family member. In particular, to demonstrate that the antibody produced by genetic 
immunization is specific for SOX17, SOX7 and other proteins were expressed in human 
fibroblasts, , and then, analyzed for cross reactivity with the SOX 17 antibody by Western blot and 
ICC. For example, the following methods were utilized for the production of the SOX 17, SOX7 
and EGFP expression vectors, their transfection into human fibroblasts and analysis by Western 
blot. Expression vectors employed for the production of SOX 17, SOX7, and EGFP were pCMV6 
(OriGene Technologies, Inc., Rockville, MD), pCMV-SPORT6 (Invitrogen, Carlsbad, CA) and 
pEGFP-Nl (Clonetech, Palo Alto, CA), respectively. For protein production, telomerase 
immortalized MDX human fibroblasts were transientiy transfected with supercoiled DNA in the 
presence of Lipofectamine 2000 (Invitrogen, Carlsbad, CA). Total cellular lysates were collected 
36 hours post-transfection in 50 mM TRIS-HCl (pH 8), 150 mM NaCl, 0.1% SDS, 0.5% 
deoxycholate, containing a cocktail of protease inhibitors (Roche Diagnostics Corporation, 
Indianapolis, IN). Western blot analysis of 100 ng of cellular proteins, separated by SDS-PAGE on 
NuPAGE (4-12 % gradient polyacrylamide, Invitrogen, Carlsbad, CA), and transferred by electro- 
blotting onto PDVF membranes (Hercules, CA), were probed with a 1/1000 dilution of the rat 
SOX17 anti-serum in 10 mM TRIS-HCl (pH 8), 150 mM NaCl, 10% BSA, 0.05 % Tween-20 
(Sigma, St. Louis, MO), followed by Alkaline Phosphatase conjugated anti-rat IgG (Jackson 
ImmunoResearch Laboratories, West Grove, PA), and revealed through Vector Black Alkaline 
Phosphatase staining (Vector Laboratories, Burlingame, CA). The proteins size standard used was 
wide range color markers (Sigma, St. Louis, MO). 

[0193] In Figure 4, protein extracts made from human fibroblast cells that were 
transiently transfected with SOX 17, SOX7 or EGFP cDNA's were probed on Western blots with 
the SOX 17 antibody. Only the protein extract from hSOX17 transfected cells produced a band of 
~51Kda which closely matched the predicted 46 Kda molecular weight of the human SOX 17 
protein. There was no reactivity of the SOX17 antibody to extracts made from either human SOX7 
or EGFP transfected cells. Furthermore, the SOX17 antibody clearly labeled the nuclei of human 
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fibroblast cells transfected with the hSOX17 expression construct but did not label cells transfected 
with EGFP alone. As such, the SOX17 antibody exhibits specificity by ICC. 

EXAMPLE 4 

Validation of SOX 17 Antibody as a Marker of Definitive Endoderm 
[0194] As evidence that the SOX17 antibody is sjpecific for human SOX 17 protein 
and furthermore marks definitive endoderm, partially differentiated hESCs were co-labeled with 
SOX17 and AFP antibodies. It has been demonstrated that SOX17, SOX7, which is a closely 
related member of the SOX gene family subgroup F (Figure 3), and AFP are each expressed in 
visceral endoderm. However, AFP and SOX7 are not expressed in definitive endoderm cells at 
levels detectable by ICC, and thus, they be can employed as negative markers for bonifide 
definitive endoderm cells. It was shown that SOX 17 antibody labels populations of cells that exist 
as discrete groupings of cells or are intermingled with AFP positive cells, hi particular. Figure 5 A 
shows that small numbers of SOX17 cells were co-labeled with AFP; however, regions were also 
found where there were little or no AFP^ cells in the field of SOX17^ cells (Figure 5B). Similarly, 
since parietal endoderm has also been reported to express SOX 17, antibody co-labeling with 
SOX17 together with the parietal markers SPARC and/or Thrombomodulin (TM) can be used to 
identify the SOX 17^ cells which are parietal endoderm. As shown in Figures 6A-C, 
Thrombomodulin and SOX 17 co-labelled parietal endoderm cells were produced by random 
differentiation of hES cells. 

[0195] Li view of the above cell labelling experiments, the identity of a definitive 
endoderm cell can be established by the marker profile SOX17*^/AFP*7[TM*° or SPARC**"], In 
other words, the expression of the SOX 17 marker is greater than the expression of the AFP marker, 
which is characteristic of visceral endoderm, and the TM or SPARC markers, which are 
characteristic of parietal endoderm. Accordingly, those cells positive for SOX 17 but negative for 
AFP and negative for TM or SPARC are defmitive endoderm. 

[0196] As a further evidence of the specificity of the S0X17^/AFP*''/TM'7SPARC''* 
marker profile as predictive of definitive endoderm, SOX17 and AFP gene expression was 
quantitatively compared to the relative number of antibody labeled cells. As shown in Figure 7 A, 
hESCs treated with retinoic acid (visceral endoderm inducer), or Activin A (defmitive endoderm 
inducer), resulted in a 10-fold difference in the level of SOX17 mRNA expression. This result 
mirrored the 10-fold difference in SOX17 antibody-labeled cell number (Figure 7B). Furthermore, 
as shown in Figure 8 A, Activin A treatment of hESCs suppressed AFP gene expression by 6.8-fold 
in comparison to no treatment. This was visually reflected by a dramatic decrease in the number of 
AFP labeled cells in these cultures as shown in Figures 8B-C. To quantify this further, it was 
demonstrated that this approximately 7-fold decrease in AFP gene expression was the result of a 
similar 7-fold decrease in AFP antibody-labeled cell number as measured by flow cytometry 
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(Figures 9A-B). This result is extremely significant in that it indicates that quantitative changes in 
gene expression as seen by Q-PCR mirror changes in cell type specification as observed by 
antibody staining. 

[0197] Licubation of hESCs in the presence of Nodal family members (Nodal, Activin 
A and Activin B - NAA) resulted in a significant increase in SOX17 antibody-labeled cells over 
time. By 5 days of continuous activin treatment greater than 50% of the cells were labeled with 
SOX 17 (Figures lOA-F), There were few or no cells labeled with AFP after 5 days of activin 
treatment. 

[0198] In summary, the antibody produced against the carboxy-terminal 242 amino 
acids of the human SOX 17 protein identified human SOX 17 protein on Westem blots but did not 
recognize SOX7, it's closest Sox family relative. The SOX17 antibody recognized a subset of cells 
in differentiating hESC cultures that were primarily SOXH^AFP"*'" (greater than 95% of labeled 
cells) as well as a small percentage (< 5%) of cells that co-label for SOX 17 and AFP (visceral 
endoderm). Treatment of hESC cultures with activins resulted in a marked elevation of SOX 17 
gene expression as well as SOX 17 labeled cells and dramatically suppressed the expression of AFP 
mRNA and the number of cells labeled with AFP antibody. 

EXAMPLES 
O-PCR Gene Expression Assav 

[0199] In the following experiments, real-time quantitative RT-PCR (Q-PCR) was the 
primary assay used for screening the effects of various treatments on hESC differentiation. In 
particular, real-time measurements of gene expression were analyzed for multiple marker genes at 
multiple time points by Q-PCR. Marker genes characteristic of the desired as well as undesired 
cell types were evaluated to gain a better understanding of the overall dynamics of the cellular 
populations. The strength of Q-PCR analysis includes its extreme sensitivity and relative ease of 
developing the necessary markers, as the genome sequence is readily available. Furthermore, the 
extremely high sensitivity of Q-PCR permits detection of gene expression from a relatively small 
number of cells within a much larger population. In addition, the ability to detect very low levels 
of gene expression provides indications for "differentiation bias" within the population. The bias 
towards a particular differentiation pathway, prior to the overt differentiation of those cellular 
phenotypes, is unrecognizable using immunocytochemical techniques. For this reason, Q-PCR 
provides a method of analysis that is at least complementary and potentially much superior to 
immunocytochemical techniques for screening the success of differentiation treatments. 
Additionally, Q-PCR provides a mechanism by which to evaluate the success of a differentiation 
protocol in a quantitative format at semi-high throughput scales of analysis. 

[0200] The approach taken here was to perform relative quantitation using SYBR 
Green chemistry on a Rotor Gene 3000 instrument (Corbett Research) and a two-step -RT-PCR 
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format. Such an approach allowed for the baiiking of cDNA samples for analysis of additional 
marker genes in the future, thus avoiding variability in the reverse transcription efficiency between 
samples. 

. [0201] Primers were designed to lie over exon-exon boundaries or span introns of at 
least 800 bp when possible, as this has been empirically determined to eliminate amplification from 
contaminating genomic DNA. When marker genes were employed that do not contain introns or 
they possess pseudogenes, DNase I treatment of RNA samples was performed. 

[0202] We routinely used Q-PCR to measure the gene expression of multiple markers 
of target and non-target cell types in order to provide a broad profile description of gene expression 
in cell samples. The markers relevant for the early phases of hESC differentiation (specifically 
ectoderm, mesoderm, definitive endoderm and extra-embryonic endoderm) and for which validated 
primer sets are available are provided below in Table 1. The human specificity of these primer sets 
has also been demonstrated. This is an important fact since the hESCs were often grown on mouse 
feeder layers. Most typically, triplicate samples were taken for each condition and independently 
analyzed in duplicate to assess the biological variability associated with each quantitative 
determination. 

[0203] To generate PGR template, total RNA was isolated using RNeasy (Qiagen) and 
quantitated using RiboGreen (Molecular Probes). Reverse transcription from 350-500 ng of total 
RNA was carried out using the iScript reverse transcriptase kit (BioRad), which contains a mix of 
oligo-dT and random primers. Each 20 reaction was subsequently diluted up to 100 yiL total 
volume and 3 was used in each 10 j^L Q-PCR reaction containing 400 nM forward and reverse 
primers and 5 2X SYBR Green master mix (Qiagen). Two step cycling parameters were used 
employing a 5 second denature at 85-94''C (specifically selected according to the melting temp of 
the amplicon for each primer set) followed by a 45 second armeal/extend at eO^'C. Fluorescence 
data was collected during the last 15 seconds of each extension phase. A three point, 10-fold 
dilution series was used to generate the standard curve for each run and cycle thresholds (Ct*s) 
were converted to quantitative values based on this standard curve. The quantitated values for each 
sample were normalized to housekeeping gene performance and then average and standard 
deviations were calculated for triplicate samples. At the conclusion of PGR cycling, a melt curve 
analysis was performed to ascertain the specificity of the reaction. A single specific product was 
indicated by a single peak at the T^ appropriate for that PGR amplicon. In addition, reactions 
performed without reverse transcriptase served as the negative control and do not amplify. 

[0204] A first step in establishing the Q-PGR methodology was validation of 
appropriate housekeeping genes (HQs) in the experimental system. Since the HG was used to 
normalize across samples for the RNA input, RNA integrity and RT efficiency, it was of value that 
the HG exhibited a constant level of expression over time in all sample types in order for the 
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normalization to be meaningful. We measured the expression levels of Cyclophilin G, 
hypoxanthine phosphoribosyltransferase 1 (HPRT), beta-2-microglobulin, hydroxymethylbiane 
synthase (HMBS), TATA-binding protein (TBP), and glucoronidase beta (GUS) in differentiating 
hESCs. Our results indicated that beta'2-microglobulin expression levels increased over the course 
of differentiation and therefore we excluded the use of this gene for normalization. The other 
genes exhibited consistent expression levels over time as well as across treatments. We routinely 
used both Cyclophilin G and GUS to calculate a normalization factor for all samples. The use of 
multiple HGs simultaneously reduces the variability inherent to the normalization process and 
increases the reliability of the relative gene expression values. 

[0205] After obtaining genes for use in normalization, Q-PCR was then utilized to 
determine the relative gene expression levels of many marker genes across samples receiving 
different experimental treatments. The marker genes employed have been chosen because they 
exhibit enrichment in specific populations representative of the early germ layers and in particular 
have focused on sets of genes that are differentially expressed in definitive endoderm and extra- 
embryonic endoderm. These genes as well as their relative enrichment profiles are highlighted in 
Table 1. 



TABLE 1 



liiliHiaiifeiiiSi 


















Endoderm 


SOX17 


definitive, visceral and parietal endoderm 




MLXLl 


endoderm and mesoderm 




GATA4 


definitive and primitive endoderm 




HNF3b 


definitive endoderm and primitive endoderm, mesoderm, neural 






plate 




GSC 


endoderm and mesoderm 


Extra-embryonic 


SOX7 


visceral endoderm 




AFP 


visceral endoderm, liver 




SPARC 


parietal endoderm 




TM 


parietal endoderm/trophectoderm 


Ectoderm 


ZICl 


neural tube, neural progenitors 


Mesoderm 


BRACK 


nascent mesoderm 



[0206] Since many genes are expressed in more than one germ layer it is useful to 
quantitatively compare expression levels of many genes within the same experiment. SOX17 is 
expressed in definitive endoderm and to a smaller extent in visceral and parietal endoderm. SOX7 
and AFP are expressed in visceral endoderm at this early developmental time point. SPARC and 
TM are expressed in parietal endoderm and Brachyury is expressed in early mesoderm. 

[0207] Definitive endoderm cells were predicted to express high levels of SOX 17 
mRNA and low levels of AFP and SOX7 (visceral endoderm), SPARC (parietal endoderm) and 
Brachyury (mesoderm). In addition, ZICl was used here to further rule out induction of early 
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ectoderm. Finally, GATA4 and HNF3b were expressed in both definitive and extra-embryonic 
endoderm, and thus, correlate with SOX17 expression in definitive endod^m (Table 1). A 
representative experiment is shown in Figures 11-14 which demonstrates how the marker genes 
described in Table 1 correlate with each other among the various samples, thus highlighting 
specific patterns of differentiation to definitive endoderm and extra-embryonic endoderm as well as 
to mesodermal and neural cell types. 

[0208] In view of the above data it is clear that increasing doses of activin resulted in 
increasing SOX 17 gene expression. Further this SOX 17 expression predominantly represented 
definitive endoderm as opposed to extra-embryonic endoderm. This conclusion stems fi-om the 
observation that SOX17 gene expression was inversely correlated with AFP, SOX7, and SPARC 
gene expression. 

EXAMPLE 6 

Directed Differentiation of Human ES Cells to Definitive Endoderm 
[0209] Human ES cell cultures will randomly differentiate if they are cultured imder 
conditions that do not actively maintain their undifferentiated state. This heterogeneous 
differentiation results in production of extra-embryonic endoderm cells comprised of both parietal 
and visceral endoderm (AFP, SPARC and SOX7 expression) as well as early ectodermal and 
mesodermal derivatives as marked by ZICl and Nestin (ectoderm) and Brachyury (mesoderm) 
expression. Definitive endoderm cell appearance has not traditionally been examined or specified 
for lack of specific antibody markers in ES cell cultures. As such, and by default, early definitive 
endoderm production in ES cell cultures has not been well studied. Since no good antibody 
reagents for definitive endoderm cells have been available, most of the characterization has focused 
on ectoderm and extra-embryonic endoderm. Overall, there are significantly greater numbers of 
extra-embryonic and neurectodermal cell types in comparison to SOX 17^ definitive endoderm cells 
in randomly differentiated ES cell cultures. 

10210] As undifferentiated hESC colonies expand on a bed of fibroblast feeders the 
edges of the colony take on alternative morphologies that are distinct fi-om those cells residing 
within the interior of the colony. Many of these outer edge cells can be distinguished by their less 
uniform, larger cell body morphology and by the expression of higher levels of OCT4. It has been 
described that as ES cells begin to differentiate they alter the levels of OCT4 expression up or 
down relative to undifferentiated ES cells. Alteration of OCT4 levels above or below the 
undifferentiated threshold may signify the initial stages of differentiation away fi-om the pluripotent 
state. 

[0211] When undifferentiated colonies were examined by SOX 17 
immunocytochemistry, occasionally small 10-15-cell clusters of SOX17-positive cells were 
detected at random locations on the periphery and at the junctions between imdifferentiated ESC 
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colonies. As noted above, these scattered pockets of outer colony edges appeared to be some of the 
first cells to differentiate away from the classical ESC morphology as the colony expanded in size 
and became more crowded. Younger, smaller folly undifferentiated colonies (< 1mm; 4-5 days 
old) showed no SOX 17 positive cells within or at the edges of the colonies while older, larger 
colonies (1-2 mm diameter, > 5days old) had sporadic isolated patches of SOX 17 positive, AFP 
negative cells at the periphery of some colonies or in regions interior to the edge that were 
differentiated away from classical hESC morphology described previously. Given that this was the 
first development of an effective SOX 17 antibody, definitive endoderm cells generated in such 
early "undifferentiated" ESC cultures have never been previously demonstrated. 

[0212] Based on negative correlations of SOX 17 and SPARC gene expression levels 
by Q-PCR, the vast majority of these SOX 17 positive, AFP negative cells will be negative for 
parietal markers by antibody co-labeling. This was specifically demonstrated for TM-expressing 
parietal endoderm cells as shown in Figures 15A-B. Exposure to Nodal factors Activin A and B 
resulted in a dramatic decrease in the intensity to TM expression and the number of TM positive 
cells. By triple labeling using SOX 17, AFP and TM antibodies on an activin treated culture, 
clusters of SOX17 positive cells which were also negative for AFP and TM were observed (Figures 
16A-D). These are the first cellular demonstrations of SOX17 positive definitive endoderm cells in 
differentiating ESC cultures (Figures 16A-D and 17). 

[0213] With the SOX 17 antibody and Q-PCR tools described above we have explored 
a number of procedures capable of efficiently programming ESCs to become SOX17*^/AFP*° / 
SPARC/TM*° definitive endoderm cells. We applied a variety of differentiation protocols aimed at 
increasing the number and proliferative capacity of these cells as measured at the population level 
by Q-PCR for SOX 17 gene expression and at the level of individual cells by antibody labeling of 
SOXl 7 protein. 

[0214] We were the first to analyze and describe the effect of TGFp family growth 
factors, such as Nodal/activin/BMP, for use in creating definitive endoderm cells fi-om embryonic 
stem cells in in vitro cell cultvires. In typical experiments, Activin A, Activin B, BMP or 
combinations of these growth factors were added to cultures of undifferentiated human stem cell 
line hESCyt-25 to begin the differentiation process. 

[0215] As shown in Figure 19, addition of Activin A at 100 ng/ml resulted in a 19- 
fold induction of SOXl 7 gene expression vs. undifferentiated hESCs by day 4 of differentiation. 
Adding Activin B, a second member of the activin family, together with Activin A, resulted in a 
37-fold induction over undifferentiated hESCs by day 4 of combined activin treatment. Finally, 
adding a third member of the TGF|3 family from the Nodal/Activin and BMP subgroups, BMP4, 
together with Activin A and Activin B, increased the fold induction to 57 times that of 
undifferentiated hESCs (Figure 19). When SOX17 induction with activins and BMP was 
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compared to no factor medium controls 5-, 10-, and IS-fold inductions resulted at the 4-day time 
point. By five days of triple treatment with Activins A, B and BMP, SOX17 was induced more 
than 70 times higher than hESCs. These data indicate tiiat higher doses and longer treatment times 
of the Nodal/activin TGFp family members results in increased expression of SOX17. 

[0216] Nodal and related molecules Activin A, B and BMP facilitate the expression of 
SOX17 and definitive endoderm formation in vivo or in vitro. Furthermore, addition of BMP 
results in an improved SOX17 induction possibly through the further induction of Cripto, the 
Nodal co-receptor. 

[0217] We have demonstrated that the combination of Activins A and B together with 
BMP4 result in additive increases in SOX17 induction and hence definitive endoderm formation. 
BMP4 addition for prolonged periods (>4 days), in combination with Activin A and B may induce 
S0X17 in parietal and visceral endoderm as well as definitive endoderm. In some embodiments of 
the present invention, it is therefore valuable to remove BMP4 fi-om the treatment within 4 days of 
addition. 

[0218] To determine the effect of TGFp factor treatment at the individual cell level, a 
time course of TGFp factor addition was examined using SOX 17 antibody labeling. As previously 
shown in Figures lOA-F, there was a dramatic increase in the relative number of SOX 17 labeled 
cells over time. The relative quantification (Figure 20) shows more than a 20-fold increase in 
SOX17-labeled cells. This result indicates that both the numbers of cells as well SOX17 gene 
expression level are increasing with time of TGFp factor exposure. As shown in Figure 21, after 
four days of exposure to Nodal, Activin A, Activin B and BMP4, the level of SOX17 induction 
reached 168-fold over undifferentiated hESCs. Figure 22 shows that the relative number of 
S0X17-positive cells was also dose responsive. Activin A doses of 100 ng/mL or more were 
capable of potently inducing SOX 17 gene expression and cell number. 

[0219] In addition to the TGFp family members, the Wnt family of molecules may 
play a role in specification and/or maintenance of definitive endoderm. The use of Wnt molecules 
was also beneficial for the differentiation of hESCs to definitive endoderm as indicted by the 
increased SOX 17 gene expression in samples that were treated with activins plus Wnt3a over that 
of activins alone (Figure 23). 

[0220] All of the experiments described above were performed using tissue culture 
medium containing 10% serum with added factors. Surprisingly, we discovered that the 
concentration of serum had an effect on the level of SOX 17 expression in the presence of added 
activins as shown in Figures 24A-C. When serum levels were reduced fi-om 10% to 2%, SOX17 
expression tripled in the presence of Activins A and B. 

[0221] Finally, we demonstrated that activin induced SOX 17^ cells divide in culture 
as depicted in Figures 25A-D. The arrows show cells labeled with SOX17/PCNA/DAPI that are in 
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mitosis as evidenced by the PCNA/DAPI-labeled mitotic plate pattern and the phase contrast 
mitotic profile, 

EXAMPLE? 

Chemokine receptor 4 rCXCR4) expression correlates with markers 
for definitive endoderm and not markers for mesoderm, ectoderm or visceral endoderm 
10222] As described above, ESCs can be induced to differentiate to the definitive 
endoderm germ layer by the application of cytokines of the TGFp family and more specifically of 
the activin/nodal subfamily. Additionally, we have shown that the proportion of fetal bovine serum 
(FBS) in the differentiation culture medium effects the efficiency of definitive endoderm 
differentiation fi-om ESCs. This effect is such that at a given concentration of activin A in the 
medium, higher levels of FBS will inhibit maximal differentiation to definitive endoderm. In the 
absence of exogenous activin A, differentiation of ESCs to the definitive endoderm lineage is very 
inefficient and the FBS concentration has much milder effects on the differentiation process of 
ESCs. 

[0223] In these experiments, hESCs were differentiated by growing in RPMI medium 
(Invitrogen, Carlsbad, CA; cat# 61870-036) supplemented with 0.5%, 2.0% or 10% FBS and either 
with or without 100 ng/mL activin A for 6 days. In addition, a gradient of FBS ranging from 0.5% 
to 2.0% over the first three days of differentiation was also used in conjunction with 100 ng/mL of 
activin A. After the 6 days, replicate samples were collected from each culture condition and 
analyzed for relative gene expression by real-time quantitative PCR. The remaining cells were 
fixed for immunofluorescent detection of SOX 17 protein. 

[0224] The expression levels of CXCR4 varied dramatically across the 7 culture 
conditions used (Figure 26). In general, CXCR4 expression was high in activin A treated cultures 
(AlOO) and low in those which did not receive exogenous activin A (NF). In addition, among the 
A 100 treated cultures, CXCR4 expression was highest when FBS concentration was lowest. There 
was a remarkable decrease in CXCR4 level in the 10% FBS condition such that the relative 
expression was more in line with the conditions that did not receive activin A (NF). 

[0225] As described above, expression of the SOX 17, GSC, MIXLl, and HNF3p 
genes is consistent with the characterization of a cell as definitive endoderm. The relative 
expression of these four genes across the 7 differentiation conditions mirrors that of CXCR4 
(Figures 27A-D). This demonstrates that CXCR4 is also a marker of definitive endoderm. 

[0226] Ectoderm and mesoderm lineages can be distinguished from definitive 
endoderm by their expression of various markers. Early mesoderm expresses the genes Brachyury 
and MOXl while nascent neuro-ectoderm expresses SOXl and ZICl. Figures 28A-D demonstrate 
that the cultures which did not receive exogenous activin A were preferentially enriched for 
mesoderm and ectoderm gene expression and that among the activin A treated cultures, the 10% 
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FBS condition also had increased levels of mesoderm and ectoderm marker expression. These 
pattems of expression were inverse to that of CXCR4 and indicated that CXCR4 was not highly 
expressed in mesoderm or ectoderm derived from ESCs at this developmental time period. 

[0227] Early during mammalian development, differentiation to extra-embryonic 
lineages also occurs. Of particular relevance here is the differentiation of visceral endoderm that 
shares the expression of many genes in common with definitive endoderm, including SOX17. To 
distinguish definitive endoderm from extra-embryonic visceral endoderm one should examine a 
marker that is distinct between these two. SOX7 represents a marker that is expressed in the 
visceral endoderm but not in the definitive endoderm lineage. Thus, culture conditions that exhibit 
robust SOX17 gene expression in the absence of SOX7 expression are likely to contain definitive 
and not visceral endoderm. It is shown in Figure 28E that SOX7 was highly expressed in cultures 
that did not receive activin A, SOX7 also exhibited increased expression even in the presence of 
activin A when FBS was included at 10%. This pattern is the inverse of the CXCR4 expression 
pattem and suggests that CXCR4 is not highly expressed in visceral endoderm. 

[0228] The relative number of SOX 17 immunoreactive (SOX 17^) cells present in 
each of the differentiation conditions mentioned above was also determined. When hESCs were 
differentiated in the presence of high dose activin A and low FBS concentration (0.5% - 2.0%) 
80X17"^ cells were ubiquitously distributed throughout the culture. When high dose activin A was 
used but FBS was included at 10% (v/v), the SOX 17^ cells appeared at much lower frequency and 
always appeared in isolated clusters rather than evenly distributed throughout the culture (Figures 
29A and C as well as B and E). A further decrease in SOXlT cells was seen when no exogenous 
activin A was used. Under these conditions the SOXH"" cells also appeared in clusters and these 
clusters were smaller and much more rare than those found in the high activin A, low FBS 
treatment (Figure 29 C and F). These results demonstrate that the CXCR4 expression pattems not 
only correspond to definitive endoderm gene expression but also to the number of defmitive 
endoderm cells in each condition. 

EXAMPLES 

Differentiation conditions that enrich for definitive endoderm 
increase the proportion of CXCR4 positive cells 
[0229] The dose of activin A also effects the efficiency at which defmitive endoderm 
can be derived, from ESCs. This example demonstrates that increasing the dose of activin A 
increases the proportion of CXCR4^ cells in the culture. 

[0230] hESCs were differentiated in RPMI media supplemented with 0.5%-2% FBS 
(increased from 0.5% to 1.0% to 2.0% over the first 3 days of differentiation) and either 0, 10, or 
100 ng/mL of activin A. After 7 days of differentiation the cells were dissociated in PBS without 
Ca^^/Mg^^ containing 2% FBS and 2 mM (EDTA) for 5 minutes at room temperature. The cells 
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were filtered through 35 um nylon filters, counted and pelleted. Pellets were resuspended in a 
small volume of 50% human serum/50% normal donkey serum and incubated for 2 minutes on ice 
to block non-specific antibody binding sites. To this, 1 uL of mouse anti-CXCR4 antibody 
(Abeam, cat# ab 10403- 100) was added per 50 uL (containing approximately 10^ cells) and labeling 
proceeded for 45 minutes on ice. Cells were washed by adding 5 mL of PBS containing 2% human 
serum (buffer) and pelleted. A second wash with 5 mL of buffer was completed then cells were 
resuspended in 50 uL buffer per 10* cells. Secondary antibody (FETC conjugated donkey anti- 
mouse; Jackson ImmunoResearch, cat# 715-096-151) was added at 5 ug/mL final concentration 
and allowed to label for 30 minutes followed by two washes in buffer as above. Cells were 
resuspended at 5x10^ cells/mL in buffer and analyzed and sorted using a FACS Vantage (Beckton 
Dickenson) by the staff at the flow cj^ometry core facility (The Scripps Research Listitute). Cells 
were collected directly into RLT lysis buffer (Qiagen) for subsequent isolation of total RNA for 
gene expression analysis by real-time quantitative PCR, 

[0231] The number of CXCR4"*' cells as determined by flow cytometry were observed 
to increase dramatically as the dose of activin A was increased in the differentiation culture media 
(Figures 30A-C). The CXCR4^ cells were those falling within the R4 gate and this gate was set 
using a secondaiiy antibody-only control for which 0.2% of events were located in the R4 gate. The 
dramatically increased numbers of CXCR4^ cells correlates with a robust increase in definitive 
endoderm gene expression as activin A dose is increased (Figures 3 1 A-D). 

EXAMPLE 9 

Isolation of CXCR4 positive cells enriches for definitive endoderm gene 
expression and depletes cells expressing markers of mesoderm, ectoderm and visceral endoderm 

[0232] The CXCR4"' and CXCR4' cells identified in Example 8 above were collected 
and analyzed for relative gene expression and the gene expression of the parent populations was 
determined simultaneously. 

[0233] The relative levels of CXCR4 gene expression was dramatically increased with 
increasing dose of activin A (Figure 32). This correlated very well with the activin A dose- 
dependent increase of CXCR4'^ cells (Figures 30A-C). It is also clear that isolation of the CXCR4'' 
cells from each population accounted for nearly all of the CXCR4 gene expression in that 
population. This demonstrates the efficiency of the FACS method for collecting these cells. 

[0234] Gene expression analysis revealed that the CXCR4'*^ cells contain not only the 
majority of the CXCR4 gene expression, but they also contained other gene expression for markers 
of definitive endoderm. As shown in Figures 31 A-D, the CXCR4'' cells were fiirther enriched over 
the parent AlOO population for SOX17, GSC, HNF3B, and MDCLl. In addition, the CXCR4' 
fi-action contained very little gene expression for these definitive endoderm markers. Moreover, 
the CXCR4^ and CXCR4" populations displayed the inverse pattern of gene expression for markers 
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of mesoderm, ectoderm and extra-embryonic endodenn. Figures 33A-D shows that the CXCR4^ 
cells were depleted for gene expression of Brachyury, MOXl, ZICl, and SOX7 relative to the 
AlOO parent population. This AlOO parent population was already low in expression of these 
markers relative to the low dose or no activin A conditions. These results show that the isolation of 
CXCR4^ cells from hESCs differentiated in the presence of high activin A j^elds a population that 
is highly enriched for and substantially pure definitive endoderm. 

EXAMPLE 10 

Quantitation of Definitive Endoderm Cells in a Cell Population Using CXCR4 
[0235] To confirm the quantitation of the proportion of definitive endoderm cells 
present in a cell culture or cell population as determined previously herein and as determined in 
United States Provisional Patent Application No. 60/532,004, entitled DEFINITIVE ENDODERM, 
filed December 23, 2003, the disclosure of which is incorporated herein by reference in its entirety, 
cells expressing CXCR4 and other markers of definitive endoderm were analyzed by FACS. 

[0236] Using the methods such as those described in the above Examples, hESCs 
were differentiated to produce definitive endoderm. In particular, increase yield and purity 
expressed in differentiating cell cultures, the serum concentration of the medium was controlled as 
follows: 0.2% FBS on dayl, 1.0% FBS on day 2 and 2.0% FBS on days 3-6. Differentiated 
cultures were sorted by FACS using three cell surface epitopes, E-Cadherin, CXCR4, and 
Thrombomodulin. Sorted cell populations were then analyzed by Q-PCR to determine relative 
expression levels of markers for definitive and extraembryonic-endoderm as well as other cell 
types. CXCR4 sorted cells taken from optimally differentiated cultures resulted in the isolation of 
definitive endoderm cells that were >98% pure. 

[0237] Table 2 shows the results of a marker analysis for a definitive endoderm 
culture that was differentiated from hESCs using the methods described herein. 

Table 2 

Composition of Definitive Endoderm Cuitures 





Percent 


Percent 


Percent 


Percent 




of 


Definitive 


Extraembryonic 


hES 


Marker(s) 


culture 


Endoderm 


endododerm 


cells 


SOX17 


70-80 


100 






Thrombomodulin 


<2 


0 


75 




AFP 


<1 


0 


25 




CXCR4 


70-80 


100 


0 




ECAD 


10 


0 




100 


other (ECAD neg.) 


10-20 








Total 


100 


100 


100 


100 



[0238] In particular. Table 2 indicates that CXCR4 and SOX17 positive cells 
(endoderm) comprised from 70%-80% of the cells in the cell culture. Of these SOX17-expressing 
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cells, less than 2% expressed TM (parietal endodenn) and less than 1% expressed AFP (visceral 
endoderm). - After subtracting the proportion of TM-positive and AFP-positive cells (combined 
parietal and visceral aidoderm; 3% total) from the proportion of SOX17/CXCR4 positive cells, it 
can be seen that about 67% to about 77% of the cell culture was definitive endoderm. 
Approximately 10% of the cells were positive for E-Cadherin (ECAD), which is a marker for 
hESCs, and about 10-20% of the cells were of other cell types. 

[0239] We have discovered that the purity of definitive endoderm in the 
differentiating cell cultures that are obtained prior to FACS separation can be improved as 
conqsared to the above-described low serum procedure by maintaining the FBS concentration at 
<0.5% throughout the 5-6 day differentiation procedure. However, maintaining the cell culture at 
<0.5% throughout the 5-6 day differentiation procedure also results in a reduced number of total 
definitive endoderm cells that are produced, 

[0240] Definitive endoderm cells produced by methods described herein have been 
maintained and expanded in culture in the presence of activin for greater than 50 days without 
appreciable differentiation. In such cases, SOX17, CXCR4, MIXLl, GATA4, HNF3p expression 
is maintained over the culture period. Additionally, TM, SPARC, OCT4, AFP, SOX7, ZICI and 
BRACK were not detected in these cultures. It is likely that such cells can be maintained and 
expanded in culture for substantially longer than 50 days without appreciable differentiation. 

EXAMPLE 11 
Additional Marker of Definitive Endoderm Cells 

[0241] In the following experiment, RNA was isolated fi-om purified definitive 
endoderm and human embryonic stem cell populations. Gene expression was then analyzed by 
gene chip analysis of the RNA from each purified population. Q-PCR was also performed to 
further investigate the potential of genes expressed in definitive endoderm, but not in embryonic 
stem cells, as a marker for definitive endoderm. 

[0242] Human embryonic stem cells (hESCs) were maintained in DMEM/F12 media 
supplemented with 20% KnockOut Serum Replacement, 4 ng/mL recombinant human basic 
fibroblast growth factor (bFGF), 0.1 mM 2-mercaptoethanol, L-glutamine, non-essential amino 
acids and penicillin/streptomycin. hESCs were differentiated to definitive endoderm by culturing 
for 5 days in RPMI media supplemented with 100 ng/mL of recombinant human activin A, fetal 
bovine serum (FBS), and penicillin/streptomycin. The concentration of FBS was varied each day 
as follows: 0.1% (first day), 0.2% (second day), 2% (days 3-5). 

[0243] Cells were isolated by fluorescence activated cell sorting (FACS) in order to 
obtain purified populations of hESCs and definitive endoderm for gene expression analysis. 
Immuno-purification was achieved for hESCs using SSEA4 antigen (R&D Systems, cat# 
FAB1435P) and for definitive endoderm using CXCR4 (R&D Systems, cat# FAB170P). Cells 
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were dissociated using trypsin/EDTA (Invitrogen, cat# 25300-054), washed in phosphate buffered 
saline (PBS) containing 2% human senun and resuspraded in 100% human serum on ice for 10 
minutes to block non-specific binding. Staining was carried out for 30 minutes on ice by adding 
200 uL of phycoerythrin-conjugated antibody to 5 x 10^ cells in 800 uL human serum. Cells were 
washed twice with 8 mL of PBS buffer and resuspended in 1 mL of the same. FACS isolation was 
carried out by the core facility of The Scripps Research Institute using a FACS Vantage (BD 
Biosciences). Cells were collected directly into RLT lysis buffer and RNA was isolated by RNeasy 
according to the manufacturers instructions (Qiagen). ' 

[0244] Purified RNA was submitted in dupHcate to Expression Analysis (Durham, 
NC) for generation of the expression profile data using the Affymetrix platform and U133 Plus 2.0 
high-density oligonucleotide arrays. Data presented is a group comparison that identifies genes 
differentially expressed between the two populations, hESCs and definitive endoderm. Genes that 
exhibited a robust upward change in expression level over that found in hESCs were selected as 
new candidate markers that are highly characteristic of definitive endoderm. Select genes were 
assayed by Q-PCR, as described above, to verify the gene expression changes found on the gene 
chip and also to investigate the expression pattern of these genes during a time course of hESC 
differentiation. 

[0245] Figures 34A-M show the gene expression results for certain markers. Results 
are displayed for cell cultures analyzed 1, 3 and 5 days after the addition of 100 ng/ml activin A, 
CXCR4-expressing definitive endoderm cells purified at the end of the five day differentiation 
procedure (CXDE), and in purified human embryonic stem cells (HESC). A comparison of Figures 
34C and G-M demonstrates that the six marker genes, FGF17, VWF, CALCR, FOXQl, CMKORl 
and CRIPl, exhibit an expression pattern that is almost identical to each other and which is also 
identical to the pattern of expression of CXCR4 and SOX17/SOX7. As described previously, 
SOX 17 is expressed in both the definitive endoderm as well as in the SOX7-expressing extra- 
embryonic endoderm. Since SOX7 is not expressed in the definitive endoderm, the ratio of 
SOX17/SOX7 provides a reliable estimate of defmitive endoderm contribution to the SOX 17 
expression witnessed in the population as a whole. The similarity of panels G-L and M to panel C 
indicates that FGF17, VWF, CALCR, FOXQl, CMKORl and CRIPl are likely markers of 
definitive endoderm and that they are not significantly expressed in extra-embryonic endoderm 
cells. 

[0246] It will be appreciated that the Q-PCR results described herein can be fiirther 
confirmed by ICC. 

[0247] The methods, compositions, and devices described herein are presently 
representative of preferred embodiments and are exemplary and are not intended as limitations on 
the scope of the invention. Changes therein and other uses will occur to those skilled in the art 
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which are encompassed wifhin the spirit of the invention and are defined by the scope of the 
disclosure. Accordingly, it will be apparent to one skilled in the art that varying substitutions and 
modifications may be made to the invention disclosed herein without departing fi'om the scope and 
spirit of the invention. 

[0248] As used in the claims below and throughout this disclosure, by the phrase 
"consisting essentially of is meant including any elements listed after the phrase, and limited to 
other elements that do not interfere with or contribute to the activity or action specified in the 
disclosure for the listed elements. Thus, the phrase "consisting essentially of indicates that the 
listed elements are required or mandatory, but that other elements are optional and may or may not 
be present depending upon whether or not they affect the activity or action of the listed elements. 
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